Novel high efficiency library-identified aav vectors

ABSTRACT

Aspects of the disclosure relate to barcoded chimeric adeno-associated virus (AAV) capsid libraries, chimeric capsids and related recombinant AAVs (rAAVs) identified using the libraries. Specifically, the chimeric AAV capsid libraries comprise a plurality of nucleic adds encoding AAV capsid proteins, wherein each nucleic acid (i) encodes a unique AAV capsid protein having distinct polypeptide regions of greater than six amino acids in length that are derived from at least two different AAV serotypes, and (ii) comprises a unique barcode sequence. Further disclosed are methods of preparing an AAV library and identifying AAV capsids tropic for a target tissue.

RELATED APPLICATIONS

This application is a continuation under 35 U.S.C. § 120 of U.S. application Ser. No. 15/516,585, filed Apr. 3, 2017, entitled, “NOVEL HIGH EFFICIENCY LIBRARY-IDENTIFIED AAV VECTORS”, which is a National Stage Application of PCT/US2015/053804, filed Oct. 2, 2015, entitled “NOVEL HIGH EFFICIENCY LIBRARY-IDENTIFIED AAV VECTORS”, which claims the benefit under 35 U.S.C. § 119(e) of U.S. Provisional Application Ser. No. 62/059,769, filed on Oct. 3, 2014, entitled, “NOVEL HIGH EFFICIENCY LIBRARY-IDENTIFIED AAV VECTORS”, the entire content of each application which is incorporated herein by reference.

FEDERALLY SPONSORED RESEARCH

This invention was made with government support under grant no. R01 NS066310 awarded by the National Institutes of Health. The government has certain rights in the invention.

FIELD

In some aspects, the disclosure provides novel AAV capsid sequences and methods of use thereof as well as related kits.

BACKGROUND

Effective gene transfer is important for development of new therapies for certain diseases (e.g., neurological diseases). AAV vectors have emerged as an effective platform for in vivo gene transfer. However, a need remains for new vectors for gene delivery.

SUMMARY OF INVENTION

The disclosure relates to compositions, kits and methods of useful for identifying adeno-associated viruses (AAVs). In some embodiments, the disclosure relates to identification of AAV capsid genes with useful targeting properties. For example, in some embodiments, AAV capsid genes are provided that have been identified using a chimeric AAV library. In some embodiments, the chimeric AAV capsid library comprises a plurality of nucleic acids encoding AAV capsid proteins. In some embodiments, each nucleic acid encodes a unique chimeric AAV capsid protein. In some embodiments, each nucleic acid encodes a unique chimeric AAV capsid protein having distinct polypeptide regions of different AAV serotypes (e.g., at least three serotypes). In some embodiments, each nucleic acid comprises a unique barcode sequence. In some embodiments, barcodes sequences in the library have the formula (NNM)_(n). In some embodiments, n is an integer in the range of is 3 to 20. In some embodiments, N is independently selected from A, G, T and C. In some embodiments, M is independently A or C.

In some embodiments, the barcoded chimeric AAV capsid library comprises a plurality of nucleic acids, wherein each nucleic acid (i) encodes a unique AAV capsid protein having distinct polypeptide regions of greater than six amino acids in length that are derived from at least two different AAV serotypes; and (ii) comprises a unique barcode sequence. In some embodiments, the nucleic acid sequences encoding the chimeric AAV capsid proteins comprise capsid sequence fragments from at least 2 different AAV serotypes.

In some embodiments, the barcode sequence comprises a single contiguous stretch of nucleotides inserted into an untranslated region (UTR) (e.g., the 5′ or 3′ UTR) of the nucleic acid sequence encoding the chimeric AAV capsid protein, thereby allowing identification of the capsid protein by detection of the barcode sequence.

In some embodiments, the barcode sequence of the barcoded chimeric AAV capsid library is not interrupted by a primer sequence or other sequence. In some embodiments, each barcode sequence of the barcoded chimeric AAV capsid library does not comprise a restriction endonuclease cleavage site.

In some embodiments, each unique barcode sequence has a length of between about 10 nucleotides and about 50 nucleotides. In some embodiments, each unique barcode sequence has a length of between about 20 and about 40 nucleotides. In some embodiments, each unique barcode sequence has a length of 30 nucleotides.

In some embodiments, the disclosure relates to a chimeric AAV capsid protein identified by the methods disclosed herein. In some embodiments, the chimeric AAV capsid protein comprises distinct polypeptide regions derived from at least two, at least three, at least four, at least five or more different AAV serotypes selected from the group consisting of: AAV1, AAV2, AAV4, AAV5, AAV6, AAV8, AAV9, AAVrh8, AAVrh10, AAVrh39, and AAVrh43. In some embodiments, the chimeric AAV capsid protein comprises distinct polypeptide regions of greater than four, greater than five, greater than six, greater than seven or more amino acids in length that are derived from different AAV serotypes (e.g., at least two) selected from the group consisting of: AAV1, AAV2, AAV4, AAV5, AAV6, AAV8, AAV9, AAVrh8, AAVrh10, AAVrh39, and AAVrh43. In some embodiments, the chimeric AAV capsid protein comprises the amino acid sequence represented by SEQ ID NO: 5-8 or SEQ ID NO: 13-16. In some embodiments, the chimeric AAV capsid protein comprises an amino acid sequence encoded by a nucleic acid sequence as set forth in SEQ ID NO: 1-4 or SEQ ID NO: 9-12. In some embodiments, isolated nucleic acids encoding such capsid proteins are provided.

In some embodiments, the disclosure relates to a rAAV comprising the capsid identified by the methods disclosed herein. In some embodiments, the rAAV selectively targets the CNS tissue or another tissue of a subject. In some embodiments, the rAAV further comprises a transgene (e.g., a protein coding gene or functional RNA encoding gene)

In another aspect, the disclosure relates to methods of producing a chimeric AAV capsid encoding nucleic acid library. In some embodiments, the method of producing a chimeric AAV capsid encoding nucleic acid library comprises preparing a set of chimeric AAV capsid encoding nucleic acids. In some embodiments, the methods comprise preparing a set of constrained, randomized barcode nucleic acids such that number of unique barcode nucleic acids is greater than the number of chimeric AAV capsid encoding nucleic acids. In some embodiments, the methods comprise combining each chimeric AAV capsid encoding nucleic acid with a unique barcode nucleic acid (e.g., of a set prepared according to methods disclosed herein).

In some embodiments, a method of generating a chimeric AAV capsid encoding nucleic acid library is provided. In some embodiments, the method comprises shuffling AAV capsid encoding nucleic acids to obtain a set of chimeric AAV capsid encoding nucleic acids. In some embodiments, each nucleic acid of the set encodes a unique chimeric AAV capsid protein having distinct polypeptide regions of at least three different AAV serotypes. In some embodiments, the methods further comprise tagging each chimeric AAV capsid encoding nucleic acid with a unique barcode nucleic acid from a set of constrained, randomized barcode nucleic acids.

In some embodiments, the disclosure relates to a method of preparing an AAV library, the method comprising: (i) transfecting AAV packaging cells with a chimeric AAV capsid encoding nucleic acid library of disclosed herein; and/or (ii) maintaining the cells under conditions that permit production of AAVs incorporating chimeric AAV capsids; and/or(iii) preparing an AAV library comprising AAVs produced in step (ii).

In some embodiments, the disclosure relates to a method of identifying AAV capsids tropic for a target tissue. In some embodiments, methods are provided that comprise administering to a subject AAVs of an AAV library produced by a method disclosed herein. In some embodiments, the methods comprise isolating nucleic acids from cells of the target tissue. In some embodiments, the methods comprise detecting AAV capsid encoding nucleic acids in the isolated nucleic acids. In some embodiments methods of identifying AAV capsids tropic for a target tissue, further comprises amplifying the nucleic acids isolated from the target tissue to facilitate detection of the AAV capsid encoding nucleic acids. In some embodiments, methods further comprises confirming that the AAV capsid encoding nucleic acids are from the library by detecting barcode sequences. In some embodiments, barcode sequences are detected by colony hybridization. In some embodiments, barcode sequences are detected by deep sequencing. In some embodiments, barcode sequences are detected by PCR. In some embodiments, PCR amplifies only the barcode sequence. In some embodiments, PCR amplifies the at least a portion of a capsid encoding sequence and a barcode sequence.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 provides a schematic representation of an approach for in vivo selection of AAV capsid libraries by using unique barcodes in the 3′UTR of each clone in the library and deep sequencing to determine their frequency in target cells/tissue followed by PCR amplification of capsid/barcode pairs from the plasmid library.

FIG. 2 depicts a sequence composition of AAV capsid clones in new barcoded library. Note that all amplified AAV genomes are chimeric.

FIGS. 3A-3C depict barcode distribution after cell culture and in vivo selection of AAV capsid library. Left panel—Selection of AAV library in human U87 glioma cells at decreasing gc/target cell. Right panel—Different doses of AAV library were infused via the tail vein into 6-8 week-old C57BL/6 mice and barcodes PCR amplified from genomic DNA isolated 3 days later. Shown are the results for barcode frequency in liver from mice infused with 5E11 or 1E9 gc, and brain barcodes from the same mouse infused with 5E11 gc.

FIG. 4 depicts brain-selected capsids which are chimeric and have more translational homology than capsids pulled from liver. The colors refer to the AAV parental origin at the nucleotide level. Multiple AAV parental origins in a single cap indicate chimerism. The amino acid sequences of brain-selected capsids (B1-B4) and liver selected capsids were aligned separately amongst themselves. A non-homologous amino acid is represented by a single black bar. Homology among capsids is indicated by a relative absence of black bars.

FIG. 5 depicts a biodistribution profile of rAAV-B1-B4 vectors which indicate extensive CNS transduction. GFP expression in the brain, spinal cord and liver was analyzed at 4 weeks post-intravenous injection in adult C57BL/6 mice. Detection of GFP expression in the brain and spinal cord was enhanced by immunofluorescence staining with an anti-GFP antibody (20× magnification). Native GFP fluorescence was used for liver sections (5× magnification). GFP transduction is present in CNS endothelia and glia for AAV9, B3 and B4. B1-derived rAAV vector exclusively transduces endothelia. CNS transduction for B2-B4-derived rAAV vectors is identical to AAV9. Liver transduction for B1- and B2-derived rAAV vectors appears to be considerably lower than for rAAV9, B3- or B4-derived rAAV vectors.

FIG. 6 depicts data showing extensive transduction of brain and spinal cord by rAAV-B1-B4 vectors. rAAV-B1-derived rAAV vector transduces the CNS endothelium at high efficiency. rAAV-B2-B4 also transduce brain endothelial cells and glia at high efficiency. rAAV-B3 and rAAV-B4 also transduce glia at high efficiency.

FIG. 7 depicts data showing that infusion of a higher dose (2E12 vg) of rAAV-B1 and rAAV-B2 results in increased transduction of cortical and hippocampal neurons, endothelia and astrocytes when compared to AAV9.

FIG. 8 depicts data showing liver transduction efficiency for rAAV-B1 and rAAV-B2 vectors is dramatically lower than AAV9, as observed by fluorescence imaging.

FIG. 9 depicts data showing biodistribution of rAAV-B1, rAAV-B2, and AAV9 vectors as analyzed by quantitative PCR (qPCR) of vector genomes in genomic DNA from different tissues. Analysis was performed 4 weeks post i.v. infusion of scAAV-CBA-GFP vector via the tail vein in adult C57BL/6 mice at a dose of 5E11 vg. Significantly more vector genomes are found in the cerebrum, cerebellum and spinal cord of rAAV-B1, compared to AAV9 (9-fold, 15-fold and 6-fold, respectively). However, both rAAV-B1 and rAAV-B2 have significantly lower numbers of vector genomes in the liver in comparison to AAV9 (4-fold and 3-fold, respectively).

FIG. 10 depicts data showing biodistribution of rAAV-B1, rAAV-B2, and AAV9 vectors as analyzed by western blot. More GFP protein was detected in the cerebrum of adult C57BL/6 mice injected with rAAV-B1 and rAAV-B2, compared to AAV9. Less GFP protein was detected in the liver of rAAV-B1 and rAAV-B2-injected mice, in comparison to AAV9.

FIG. 11 depicts data showing that rAAV-B1 vector transduces peripheral tissues to a greater extent than AAV9, as determined by qPCR. Significantly more vector genomes are found in the forelimb muscle, hind limb muscle, heart and pancreas of both rAAV-B1 and rAAV-B2-injected mice, as compared to AAV9. The extent of transduction of rAAV-B1 is greater than rAAV-B2 for each of these tissues. In addition, rAAV-B1 had significantly more vector genomes in kidney and lung in comparison to AAV9.

FIG. 12 depicts data showing that rAAV-B1 vector transduces peripheral tissues to a greater extent than AAV9, as determined by immunofluorescence microscopy.

FIG. 13 depicts data showing effects of rAAV-B1-GFP and rAAV-B2-GFP vectors that were infused into 4-6 week-old kittens (n=1/vector) via the carotid artery at a dose of 3.5E12 vg. Neuronal and astrocytic populations (identified by morphology) were found to be transduced in both cat brains. rAAV-B2 has pronounced astrocytic transduction. Loci with high density of neurons were found transduced in rAAV-B1 cat brain.

FIG. 14 depicts a schematic of a screen to identify capsid-barcode pairs. Briefly, the entire capsid-barcode pool is amplified from DNA isolated from the viral library, using universal cap-flanking primers. This pool is then cloned into a plasmid to create a cap-barcode library. This is then transformed and plated at high-density (2.5E⁵ bacteria/filter) on nylon filters placed on 150 mm LB-agar plates. These master filters are then replica-plated, the bacteria lysed on the replica filters and DNA cross-linked by UV. These filters are then hybridized to radioactively-labeled oligos of barcodes of interest. The replica filter is then keyed back to the master filter. Since it is difficult to pick single colonies from the high-density master filter, the areas on the master filter corresponding to spots on replica filter, are cut out and the bacteria on those areas re-plated at lower density. These are then replica filtered and hybridized by regular low-density hybridization to the radiolabeled oligo probe, and subsequently positive single colonies are picked and send for sequencing of entire capsid (and barcode).

FIGS. 15A-15D depict a multiple sequence alignment. The consensus sequence is shown as the top line of the alignment. Variant bases are shaded. The ‘consensus’ strand corresponds to SEQ ID NO: 96. The ‘1. B1’ strand corresponds to SEQ ID NO: 1. The ‘2. B2’ strand corresponds to SEQ ID NO: 2. The ‘3. B3’ strand corresponds to SEQ ID NO: 4. The ‘1. B4’ strand corresponds to SEQ ID NO: 4.

FIGS. 16A-16B depict a multiple sequence alignment. Variant residues are shaded. The ‘1. B1 translation’ strand corresponds to SEQ ID NO: 5. The ‘2. B2 translation’ strand corresponds to SEQ ID NO: 6. The ‘3. B3 translation’ strand corresponds to SEQ ID NO: 7. The ‘4. B4 translation’ strand corresponds to SEQ ID NO: 8.

FIGS. 17A-17D depict a multiple sequence alignment. The consensus sequence is shown as the top line of the alignment. Variant bases are shaded. The ‘1. L1’ strand corresponds to SEQ ID NO: 9. The ‘2. L2’ strand corresponds to SEQ ID NO: 10. The ‘3. L3’ strand corresponds to SEQ ID NO: 11. The ‘4. L4’ strand corresponds to SEQ ID NO: 12.

FIGS. 18A-18B depict a multiple sequence alignment. Variant residues are shaded. The ‘1. L1 translation’ strand corresponds to SEQ ID NO: 13. The ‘2. L2 translation’ strand corresponds to SEQ ID NO: 14. The ‘3. L3 translation’ strand corresponds to SEQ ID NO: 15. The ‘4. L4 translation’ strand corresponds to SEQ ID NO: 16.

FIGS. 19A-19B depict a schematic representation of native capsid contributions to B1. Capsid contributors to B1 are AAV1, AAVrh.8, AAVrh.10, AAV6, AAVrh.43, AAV2, and AAVrh.39. The sequence depicted corresponds to SEQ ID NO: 1.

FIGS. 20A-20C depict a schematic representation of native capsid contributions to B2. Capsid contributors to B2 are AAVrh.39, AAVrh.43, AAV1, AAV6, AAV8, AAVrh.10, and AAVrh.8. The sequence depicted corresponds to SEQ ID NO: 2.

FIGS. 21A-21C depict a schematic representation of native capsid contributions to B3. Capsid contributors to B3 are AAV8, AAVrh.39, AAV1, AAVrh.8, AAV6, and AAVrh.43. The sequence depicted corresponds to SEQ ID NO: 3.

FIGS. 22A-22D depict a schematic representation of native capsid contributions to B4. Capsid contributors to B4 are AAVrh.39, AAV1, AAVrh.43, AAVrh.8, AAV6, and AAV8. The sequence depicted corresponds to SEQ ID NO: 4.

FIGS. 23A-23D show CNS transduction profile of AAV-B1 vector after intravascular infusion in adult mice. FIG. 23A provides an overview of GFP distribution in brains of AAV-B1 and AAV9 injected mice (2×10¹² vg/mouse). Representative images of coronal brain sections located at +0.5 mm, −1.80 mm and −3.00 mm (left to right) in relation to bregma are shown. Transduction of neuronal populations in different CNS regions of AAV-B1 injected mice (2×10¹² vg/mouse) is shown in FIG. 23B. Black arrows indicate examples of GFP-positive neurons identified by morphology. Bar=50 μm. FIG. 23C shows AAV vector genome content in cerebrum, cerebellum and spinal cords (N=4 animals per group) (5×10¹¹ vg/mouse). Age matched non-injected mice were included as controls (not shown). FIG. 23D shows Western blot analysis of GFP expression in cerebrum, cerebellum and spinal cord of 2 animals per group (5×10¹¹ vg/mouse). Signal intensity of GFP was normalized to corresponding β-actin signal intensity for quantitative comparison. **p<0.01, ***p<0.001, ****p<0.0001 by Student's unpaired t-test.

FIGS. 24A-24B show neuronal transduction in cat after systemic delivery of AAV-B1 vectors. Transduction of neurons in the cat brain after systemic delivery of AAV-B1 vector (3.4×10¹² vg). Representative images show GFP-positive cells with neuronal morphology in various structures in the brain. Bar=50 μm.

FIGS. 25A-25F show AAV-B1 biodistribution to mouse liver and muscle after intravascular delivery. Native GFP expression in (FIG. 25A) livers of AAV-B1 and AAV9 injected mice (2×10¹² vg/mouse) and (FIG. 25D) skeletal muscle (triceps and quadriceps), diaphragm and heart of AAV-B1 injected mice (5×10¹¹ vg/mouse). AAV vector genome content (N=4 animals per group) (FIG. 25B, liver; FIG. 25E muscle groups) and Western blot analysis of GFP expression (N=2 animals per group) (FIG. 25C, liver; FIG. 25F muscle groups) are shown (5×10¹¹ vg/mouse). Signal intensity of GFP was normalized to corresponding β-actin signal intensity for quantitative comparison. *p<0.05, **p<0.01, by Student's unpaired t-test.

FIGS. 26A-26E show transduction of mouse beta cells, alveolar epithelium and retinal vasculature after intravenous infusion of AAV-B1. GFP expression in FIG. 26A pancreas, FIG. 26D lung, and FIG. 26G retina of AAV-B1 injected mice (5×10¹¹ vg/mouse). White arrow in FIG. 26A indicates GFP-positive insulin-producing beta cells. Inset in FIG. 26G shows individual GFP-positive blood vessels. AAV vector genome content (N=4 animals per group) (FIG. 26B, pancreas; FIG. 26E lung) and FIG. 26C. Western blot analysis of GFP expression (N=2 animals per group) (FIG. 26C, pancreas; FIG. 26E lung) are shown. Signal intensity of GFP was normalized to corresponding β-actin signal intensity for quantitative comparison. *p<0.05, ****p<0.0001 by Student's unpaired t-test.

FIGS. 27A-27D show biophysical characterization of AAV-B1. FIG. 27A shows a predicted molecular model of AAV-B1 capsid. FIG. 27B shows a surface exposed variable region-IV (VR-IV) of AAV-B1 (left) and AAV8 (right). FIG. 27C shows pooled human IVIg neutralization assay and FIG. 27D shows a CHO cell binding assay of AAV-B1 and AAV9 vectors. Data shown as mean±SEM in FIG. 27C, and as mean±SD in FIG. 27D. Experiment was performed with N=3 biological replicates. *p<0.05 by one-way ANOVA.

FIG. 28 provides a schematic cartoon of one embodiment of a single round in-vivo biopanning strategy

FIGS. 29A-29B show the chimeric nature of packaged viral library. FIG. 30A shows parental capsid contribution to clones 1-5 isolated at random from packaged viral library. FIG. 30B shows homology between the clones at the amino acid level. Grey areas indicate homology; black lines indicate non-homologous amino acids. % homology is calculated for amino acid composition.

FIGS. 30A-30D show the chimeric and homologous nature of brain-resident capsids. Parental capsid contribution to (FIG. 30A) brain-selected capsids AAV-B1, -B2, -B3 and -B4, and (FIG. 30B) 4 liver-resident variants chosen at random. Homology among (FIG. 31C) brain and (FIG. 30D) liver clones at the amino acid level. Grey areas indicate homology; black lines indicate non-homologous amino acids. % homology is calculated for amino acid composition.

FIG. 31 shows a transduction profile of AAV-B1 vector across multiple CNS regions after systemic delivery. Black arrows indicate examples of GFP-positive neurons. Bar=50 μm.

FIGS. 32A-32G show phenotyping of GFP positive cells in CNS after systemic delivery of AAV-B1. Transduced cells were identified by double immunofluorescence staining with antibodies to GFP, pan-neuronal marker NeuN (FIG. 32A, FIG. 32B and FIG. 32D), striatal medium spiny neuron marker DARPP32 (FIG. 32A), dopaminergic neuron marker tyrosine hydroxylase (TH) (FIG. 32B), Purkinje neuron marker calbindin-D-28k (Cal28K) (FIG. 32C), endothelial marker CD31 (FIG. 32E), and mature oligodendrocyte marker APC (FIG. 32F). The large size, morphology and location of GFP-positive neurons in the ventral spinal cord suggest a motor neuron identity. GFP-positive astrocytes (FIG. 32G) were identified based on their morphology. White arrows indicate examples of co-localization. Bar=10 μm.

FIGS. 33A-33E show a comparison of AAV-B1 capsid protein sequence to AAV8 and other natural AAV isolates. The top two lines highlight the similarities and differences between amino acid sequences of AAV8 and AAV-B1, with singleton residue variants relative to AAV8 highlighted in red. The residue highlighted in green indicates a variant residue with no corresponding orthologs in the nine AAV species presented in the alignment. Translation start sites for VP1, VP2, and VP3 are indicated with filled triangles. The conserved parvovirus phospholipase A2 domain (approximately residues 44 to 104) in the VP1 unique region with the conserved AAV calcium-binding motif (Y-X-G-P-G/F) and catalytic residues (H-D-X-X-Y) are indicated with filled rectangles. The secondary structural elements are labeled with the corresponding text and the following symbols: β sheets B, C, D, E, F, G, and I are indicated with a horizontal overlined arrow. The positions of the variable loops (VR), I through IX, are indicated. The position of the conserved α helix is indicated with three parallel horizontal lines. The ‘AAV8’ strand corresponds to SEQ ID NO: 88. The ‘AAV-B1’ strand corresponds to SEQ ID NO: 5. The ‘AAV1’ strand corresponds to SEQ ID NO: 89. The ‘AAV2’ strand corresponds to SEQ ID NO: 90. The ‘AAV5’ strand corresponds to SEQ ID NO: 91. The ‘AAV6’ strand corresponds to SEQ ID NO: 92. The ‘AAV7’ strand corresponds to SEQ ID NO: 93. The ‘AAV9’ strand corresponds to SEQ ID NO: 94. The ‘AAVrh10’ strand corresponds to SEQ ID NO: 95.

FIGS. 34A-34C show a comparison of biodistribution profile of AAV-B1 and AAV8 after systemic delivery (5×10¹¹ vg/mouse). AAV vector genome content (N=4 animals per group) in (FIG. 34A) CNS, (FIG. 34B) liver and (FIG. 34C) skeletal muscle (quadriceps) is shown. *p<0.05, **p<0.01, ***p<0.001, ****p<0.0001 by Student's unpaired t-test.

FIGS. 35A-35C show biodistribution profile of AAV-B1 systemically infused at lower dose (5×10¹⁰ vg/mouse). AAV vector genome content (N=4 animals per group) in (FIG. 35A) CNS, (FIG. 35B) muscle groups and (FIG. 35C) peripheral tissues is shown. *p<0.05, **p<0.01, by Student's unpaired t-test.

DETAILED DESCRIPTION OF INVENTION

In some aspects, provided herein are recombinant AAVs having desirable tissue targeting and transduction properties that make them useful for certain gene therapy and research applications. In some embodiments, the disclosure relates to chimeric AAV capsids that are useful for transducing various target tissues. The disclosure provides in some aspects novel AAV capsid proteins that have been identified from a barcoded chimeric AAV capsid library. As used herein, the term “library” refers to an collection of entities, such as, for example, viral particles (e.g., rAAVs), molecules (e.g., nucleic acids), etc. A library may comprise at least two, at least three, at least four, at least five, at least ten, at least 25, at least 50, at least 10², at least 10³, at least 10⁴, at least 10⁵, at least 10⁶, at least 10⁷, at least 10⁸, at least 10⁹, or more different entities (e.g., viral particles, molecules (e.g., nucleic acids)). In some embodiments, a library entity (e.g., a viral particle, a nucleic acid) can be associated with or linked to a tag (e.g., a barcode), which can facilitate recovery or identification of the entity. For example, in some embodiments, libraries provided herein comprise a collection of barcoded nucleic acids. In some embodiments, a library refers to a collection of nucleic acids that are propagatable, e.g., through a process of clonal amplification. Library entities can be stored, maintained or contained separately or as a mixture.

As used herein, “barcode sequence” refers to a sequence of a biological molecule (e.g., nucleic acid, protein, etc.) that when combined with the sequence another biological molecule serves to identify the other biological molecule. Typically, a barcode sequence is a heterologous sequence that is incorporated within or appended to a sequence of a target biological molecule and utilized as a reference in order to identify a target molecule of interest. In some embodiments, a barcode sequence is a sequence of a nucleic acid (e.g., a heterologous or synthetic nucleic acid) that is incorporated within or appended to a target nucleic acid and utilized as a reference in order to identify the target nucleic acid. In some embodiments, a barcode sequence is of the formula (NNM). In some embodiments, n is an integer in the range of 5 to 20, 5 to 10, 10 to 20, 7 to 20, or 7 to 30. In some embodiments, n is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 or more. In some embodiments, N are each nucleotides that are independently selected from A, G, T and C and M is a nucleotide that is either A or C. Thus, in some embodiments, the disclosure provides nucleic acids that (i) have a target sequence of interest (e.g., a coding sequence (e.g., that encodes a protein or functional RNA)); and (ii) comprises a unique barcode sequence. In some embodiments, the target sequence of interest encodes an AAV capsid (e.g., a chimeric AAV capsid).

In some aspects, the disclosure provides barcoded chimeric AAV capsid libraries. DNA shuffling may be used to create a library of nucleic acid sequences encoding chimeric AAV capsid proteins. In some embodiments, each nucleic acid sequence of the library comprises nucleic acid fragments from the capsid genes of multiple AAV serotypes. Thus, new AAV variants having altered tissue tropism may be created. In some embodiments, AAV capsid genes having altered tissue tropism are identified using a barcoded chimeric AAV library. In some embodiments, the altered tissue tropism allows for targeting of a particular tissue by the AAV capsid protein. In some embodiments, the target tissue is a tissue selected from the group consisting of heart, lung, spleen, pancreas, skeletal muscle, kidney, intestine, and stomach. In some embodiments, the tissue targeted by the AAV capsid protein is the central nervous system (CNS). In some embodiments, the target tissue is the brain. In some embodiments, the target tissue is the liver.

In some embodiments, the barcoded chimeric AAV capsid library comprises a plurality of nucleic acids, wherein each nucleic acid (i) encodes a unique chimeric AAV capsid protein having distinct polypeptide regions of at least three different AAV serotypes; and/or (ii) comprises a unique barcode sequence. In some embodiments, the barcoded chimeric AAV capsid library comprises a plurality of nucleic acids. In some embodiments, each nucleic acid (i) encodes a unique chimeric AAV capsid protein; and/or (ii) comprises a unique barcode sequence. In some embodiments, barcodes sequences in the library have the formula (NNM)n. In some embodiments, n is an integer in the range of is 5 to 20, 5 to 10, 10 to 20, 7 to 20, or 7 to 30. In some embodiments, n is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 or more. In some embodiments, N are each nucleotides that are independently selected from A, G, T and C and M is a nucleotide that is either A or C.

In some embodiments, the barcoded chimeric AAV capsid library comprises a plurality of nucleic acids, wherein each nucleic acid (i) encodes a unique AAV capsid protein having distinct polypeptide regions of greater than six amino acids in length that are derived from at least two different AAV serotypes; and (ii) comprises a unique barcode sequence. In some embodiments, the nucleic acid sequences encoding the chimeric AAV capsid proteins comprise capsid sequence fragments from at least 2 different AAV serotypes.

In some embodiments, the barcode sequence comprises a single contiguous stretch of nucleotides inserted into the 3′ untranslated region (UTR) of the nucleic acid sequence encoding the chimeric AAV capsid protein, thereby allowing identification of the capsid protein by detection of the barcode sequence.

In some embodiments, the barcode sequence of the barcoded chimeric AAV capsid library is not interrupted by a primer sequence. In some embodiments, each barcode sequence of the barcoded chimeric AAV capsid library does not comprise a restriction endonuclease cleavage site.

In some embodiments, each unique barcode sequence has a length of between about 10 nucleotides and about 50 nucleotides. In some embodiments, each unique barcode sequence has a length of between about 20 and about 40 nucleotides. In some embodiments, each unique barcode sequence has a length of 30 nucleotides.

Construction of the chimeric AAV capsid libraries of the disclosure maybe carried out by any suitable means known in the art, for example DNA family shuffling (Crameri et al., Nature, 391: 288-291 (1998), the contents of which are incorporated herein by reference in their entirety). Methods of incorporating nucleic acid barcodes into libraries are also known in the art, for example through the addition of adapter sequences.

In another aspect, the disclosure relates to methods of producing a chimeric AAV capsid encoding nucleic acid library. In some embodiments, the method of producing a chimeric AAV capsid encoding nucleic acid library comprises (i) preparing a set of chimeric AAV capsid encoding nucleic acids; (ii) preparing a set of constrained, randomized barcode nucleic acids such that number of unique barcode nucleic acids is greater than the number of chimeric AAV capsid encoding nucleic acids; and (iii) combining each chimeric AAV capsid encoding nucleic acid with a unique barcode nucleic acid of the set prepared in (ii).

In some embodiments, methods of generating a chimeric AAV capsid encoding nucleic acid library are provided that comprise shuffling AAV capsid encoding nucleic acids to obtain a set of chimeric AAV capsid encoding nucleic acids. In some embodiments, the methods further comprise tagging each chimeric AAV capsid encoding nucleic acid with a unique barcode nucleic acid from a set of constrained, randomized barcode nucleic acids. In some embodiments, each nucleic acid of a set encodes a unique chimeric AAV capsid protein having distinct polypeptide regions of at least three different AAV serotypes.

In some embodiments, barcode nucleic acids in a library produced by methods disclosed herein have the formula: (NNM)_(n). In some embodiments, n is an integer in the range of is 5 to 20, 5 to 10, 10 to 20, 7 to 20, or 7 to 30. In some embodiments, n is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 or more. In some embodiments, N are each nucleotides that are independently selected from A, G, T and C and M is a nucleotide that is either A or C. In some embodiments, the chimeric AAV capsid encoding nucleic acids each encode a unique chimeric AAV capsid protein having distinct polypeptide regions of at least three different AAV serotypes. In some embodiments, the chimeric AAV capsid encoding nucleic acids each encode a unique chimeric AAV capsid protein having distinct polypeptide regions of greater than six amino acids in length that are derived from at least two different AAV serotypes. In some embodiments, each barcode nucleic acid comprises a single contiguous stretch of nucleotides inserted into the 3′ untranslated region (UTR) of the nucleic acid sequence encoding the chimeric AAV capsid protein, thereby allowing identification of the capsid protein by detection of the barcode sequence.

In some embodiments, the disclosure relates to a method of preparing an AAV library, in which the method comprises (i) transfecting AAV packaging cells with the chimeric AAV capsid encoding nucleic acid library disclosed herein; and/or (ii) maintaining the cells under conditions that permit production of AAVs incorporating chimeric AAV capsids; and/or (iii) preparing an AAV library comprising AAVs produced in step (ii).

In some embodiments, the disclosure relates to methods of identifying AAV capsids tropic for a target tissue, in which the methods comprise: (i) administering to a subject AAVs of an AAV library produced by a method disclosed herein; and/or (ii) isolating nucleic acids from cells of the target tissue; and/or (iii) detecting AAV capsid encoding nucleic acids in the isolated nucleic acids. In some embodiments, methods of identifying AAV capsids tropic for a target tissue, further comprise amplifying nucleic acids isolated from a target tissue to facilitate detection of the AAV capsid encoding nucleic acids. In some embodiments, methods further comprise confirming that the AAV capsid encoding nucleic acids are from the library by detecting barcode sequences. In some embodiments, the barcode sequences are detected by colony hybridization. In some embodiments, barcode sequences are detected by deep sequencing. In some embodiments, barcode sequences are detected by PCR. In some embodiments, PCR amplifies only the barcode sequence. In some embodiments, the PCR amplifies the entire capsid sequence and the barcode sequence.

Isolated AAV Capsid Proteins and Nucleic Acids Encoding the Same

AAVs isolated from mammals, particularly non-human primates, are useful for creating gene transfer vectors for clinical development and human gene therapy applications. In some embodiments, recombinant AAVs (rAAVs) disclosed herein have a tropism for the CNS. Accordingly, in some embodiments, the rAAV vectors of the instant disclosure are particularly useful for gene therapy of CNS disorders. In some embodiments, AAV capsid proteins are provided that have been obtained using barcoded chimeric AAV capsid libraries. Protein and amino acid sequences as well as other information regarding the AAVs capsid are set forth in the sequence listing. In some embodiments, a fragment (portion) of an isolated nucleic acid encoding a AAV capsid sequence may be useful for constructing a nucleic acid encoding a desired capsid sequence. Fragments may be of any appropriate length (e.g., at least 6, at least 9, at least 18, at least 36, at least 72, at least 144, at least 288, at least 576, at least 1152 or more nucleotides in length). For example, a fragment of nucleic acid sequence encoding a polypeptide of a first AAV capsid protein may be used to construct, or may be incorporated within, a nucleic acid sequence encoding a second AAV capsid sequence to alter the properties of the AAV capsid. In some embodiments, AAV capsid proteins that comprise capsid sequence fragments from multiple AAV serotypes are referred to as chimeric AAV capsids.

In some embodiments, the disclosure provides chimeric capsid proteins comprising distinct polypeptide regions derived from at least 2, at least 3, at least 4, at least 5, at least 6, or more different AAV serotypes. In some embodiments, AAV serotypes are selected from the group consisting of: AAV1, AAV2, AAV4, AAV5, AAV6, AAV8, AAV9, AAVrh8, AAVrh10, AAVrh39, and AAVrh43. In some embodiments, the chimeric capsid proteins comprise contributions from the native capsid serotypes listed above. In some embodiments, the capsid contributors are AAV1, AAVrh.8, AAVrh.10, AAV6, AAVrh.43, AAV2, and AAVrh.39. In some embodiments, the capsid contributors are AAVrh.39, AAVrh.43, AAV1, AAV6, AAV8, AAVrh.10, and AAVrh.8. In some embodiments, the capsid contributors are AAV8, AAVrh.39, AAV1, AAVrh.8, AAV6, and AAVrh.43. In some embodiments, the capsid contributors to B4 are AAVrh.39, AAV1, AAVrh.43, AAVrh.8, AAV6, and AAV8. In some embodiments, the chimeric capsid proteins comprise distinct polypeptide regions of greater than 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 50, 75, 100 or more amino acids in length that are derived from different AAV serotypes. In some embodiments, the AAV serotypes are selected from the group consisting of AAV1, AAV2, AAV4, AAV5, AAV6, AAV8, AAV9, AAVrh8, AAVrh10, AAVrh39, and AAVrh43.

In some embodiments, the disclosure provides for chimeric AAV capsid proteins that are identified by the methods disclosed herein. In some embodiments, the amino acid sequences of the identified chimeric AAV capsid proteins are represented by SEQ ID NO: 5-8 or SEQ ID NO: 13-16. In some embodiments, the chimeric AAV capsid proteins are encoded by nucleic acids represented by SEQ ID NO: 1-4 or SEQ ID NO: 9-12.

The skilled artisan will also realize that conservative amino acid substitutions may be made to provide functionally equivalent variants, or homologs of the capsid proteins. In some aspects the disclosure embraces sequence alterations that result in conservative amino acid substitutions. As used herein, a conservative amino acid substitution refers to an amino acid substitution that does not alter the relative charge or size characteristics of the protein in which the amino acid substitution is made. Variants can be prepared according to methods for altering polypeptide sequence known to one of ordinary skill in the art such as are found in references that compile such methods, e.g., Molecular Cloning: A Laboratory Manual, J. Sambrook, et al., eds., Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989, or Current Protocols in Molecular Biology, F. M. Ausubel, et al., eds., John Wiley & Sons, Inc., New York. Conservative substitutions of amino acids include substitutions made among amino acids within the following groups: (a) M, I, L, V; (b) F, Y, W; (c) K, R, H; (d) A, G; (e) S, T; (f) Q, N; and (g) E, D. Therefore, one can make conservative amino acid substitutions to the amino acid sequence of the proteins and polypeptides disclosed herein.

As used herein, the term “nucleic acid” refers to polymers of linked nucleotides, such as DNA, RNA, etc. In some embodiments, proteins and nucleic acids of the disclosure are isolated. As used herein, the term “isolated” means artificially produced. As used herein with respect to nucleic acids, the term “isolated” means: (i) amplified in vitro by, for example, polymerase chain reaction (PCR); (ii) recombinantly produced by cloning; (iii) purified, as by cleavage and gel separation; or (iv) synthesized by, for example, chemical synthesis. An isolated nucleic acid is one which is readily manipulable by recombinant DNA techniques well known in the art. Thus, a nucleotide sequence contained in a vector in which 5′ and 3′ restriction sites are known or for which polymerase chain reaction (PCR) primer sequences have been disclosed is considered isolated but a nucleic acid sequence existing in its native state in its natural host is not. An isolated nucleic acid may be substantially purified, but need not be. For example, a nucleic acid that is isolated within a cloning or expression vector is not pure in that it may comprise only a tiny percentage of the material in the cell in which it resides. Such a nucleic acid is isolated, however, as the term is used herein because it is readily manipulable by standard techniques known to those of ordinary skill in the art. As used herein with respect to proteins or peptides, the term “isolated” refers to a protein or peptide that has been artificially produced (e.g., by chemical synthesis, by recombinant DNA technology, etc.).

“Homology” refers to the percent identity between two polynucleotides or two polypeptides. The term “substantial homology” when referring to a nucleic acid, or fragment thereof, indicates that, when optimally aligned with appropriate nucleotide insertions or deletions with another nucleic acid (or its complementary strand), there is nucleotide sequence identity in about 90 to 100% of the aligned sequences. When referring to a polypeptide, or fragment thereof, the term “substantial homology” indicates that, when optimally aligned with appropriate gaps, insertions or deletions with another polypeptide, there is nucleotide sequence identity in about 90 to 100% of the aligned sequences. The term “highly conserved” means at least 80% identity, preferably at least 90% identity, and more preferably, over 97% identity. In some cases, highly conserved may refer to 100% identity. Identity is readily determined by one of skill in the art by, for example, the use of algorithms and computer programs known by those of skill in the art.

As described herein, alignments between sequences of nucleic acids or polypeptides are performed using any of a variety of publicly or commercially available Multiple Sequence Alignment Programs, such as “Clustal W”, accessible through Web Servers on the internet. Alternatively, Vector NTI utilities may also be used. There are also a number of algorithms known in the art which can be used to measure nucleotide sequence identity, including those contained in the programs described above. As another example, polynucleotide sequences can be compared using BLASTN, which provides alignments and percent sequence identity of the regions of the best overlap between the query and search sequences. Similar programs are available for the comparison of amino acid sequences, e.g., the “Clustal X” program, BLASTP. Typically, any of these programs are used at default settings, although one of skill in the art can alter these settings as needed. Alternatively, one of skill in the art can utilize another algorithm or computer program which provides at least the level of identity or alignment as that provided by the referenced algorithms and programs. Alignments may be used to identify corresponding amino acids between two proteins or peptides. A “corresponding amino acid” is an amino acid of a protein or peptide sequence that has been aligned with an amino acid of another protein or peptide sequence. Corresponding amino acids may be identical or non-identical. A corresponding amino acid that is a non-identical amino acid may be referred to as a variant amino acid.

Alternatively for nucleic acids, homology can be determined by hybridization of polynucleotides under conditions which form stable duplexes between homologous regions, followed by digestion with single-stranded-specific nuclease(s), and size determination of the digested fragments. DNA sequences that are substantially homologous can be identified in a Southern hybridization experiment under, for example, stringent conditions, as defined for that particular system. Defining appropriate hybridization conditions is within the skill of the art.

Recombinant AAVs

In some aspects, the disclosure provides isolated AAVs. As used herein with respect to AAVs, the term “isolated” refers to an AAV that has been artificially obtained or produced. Isolated AAVs may be produced using recombinant methods. Such AAVs are referred to herein as “recombinant AAVs”. Recombinant AAVs (rAAVs) preferably have tissue-specific targeting capabilities, such that a transgene of the rAAV will be delivered specifically to one or more predetermined tissue(s). The AAV capsid is an important element in determining these tissue-specific targeting capabilities. Thus, an rAAV having a capsid appropriate for the tissue being targeted can be selected. In some embodiments, the rAAV comprises a capsid protein having an amino acid sequence as set forth in any one of SEQ ID NOs 5-8 or SEQ ID NO: 13-16, or a protein having substantial homology thereto.

In some embodiments, the rAAVs of the disclosure are pseudotyped rAAVs. Pseudotyping is the process of producing viruses or viral vectors in combination with foreign viral envelope proteins. The result is a pseudotyped virus particle. With this method, the foreign viral envelope proteins can be used to alter host tropism or an increased/decreased stability of the virus particles. In some aspects, a pseudotyped rAAV comprises nucleic acids from two or more different AAVs, wherein the nucleic acid from one AAV encodes a capsid protein and the nucleic acid of at least one other AAV encodes other viral proteins and/or the viral genome. In some embodiments, a pseudotyped rAAV refers to an AAV comprising an inverted terminal repeats (ITRs) of one AAV serotype and an capsid protein of a different AAV serotype. For example, a pseudotyped AAV vector containing the ITRs of serotype X encapsidated with the proteins of Y will be designated as AAVX/Y (e.g., AAV2/1 has the ITRs of AAV2 and the capsid of AAV1). In some embodiments, pseudotyped rAAVs may be useful for combining the tissue-specific targeting capabilities of a capsid protein from one AAV serotype with the viral DNA from another AAV serotype, thereby allowing targeted delivery of a transgene to a target tissue.

Methods for obtaining recombinant AAVs having a desired capsid protein are well known in the art. (See, for example, US 2003/0138772), the contents of which are incorporated herein by reference in their entirety). Typically the methods involve culturing a host cell which contains a nucleic acid sequence encoding an AAV capsid protein (e.g., a nucleic acid having a sequence as set forth in any one of SEQ ID NOs: 1-4 or SEQ ID NO: 9-12, or fragment thereof; a functional rep gene; a recombinant AAV vector composed of, AAV inverted terminal repeats (ITRs) and a transgene; and sufficient helper functions to permit packaging of the recombinant AAV vector into the AAV capsid proteins.

In some embodiments, the disclosure relates to a rAAV comprising a chimeric capsid protein identified by the methods disclosed herein. In some embodiments, the rAAV targets the CNS tissue of a subject. In some embodiments, the rAAV further comprises a transgene. In some embodiments, the transgene is a CNS-associated gene. In some embodiments, the rAAV targets the liver tissue of a subject. In some embodiments, the transgene is a Liver- or CNS-associated miRNA.

The components to be cultured in the host cell to package a rAAV vector in an AAV capsid may be provided to the host cell in trans. Alternatively, any one or more of the required components (e.g., recombinant AAV vector, rep sequences, cap sequences, and/or helper functions) may be provided by a stable host cell which has been engineered to contain one or more of the required components using methods known to those of skill in the art. Most suitably, such a stable host cell will contain the required component(s) under the control of an inducible promoter. However, the required component(s) may be under the control of a constitutive promoter. Examples of suitable inducible and constitutive promoters are provided herein, in the discussion of regulatory elements suitable for use with the transgene. In still another alternative, a selected stable host cell may contain selected component(s) under the control of a constitutive promoter and other selected component(s) under the control of one or more inducible promoters. For example, a stable host cell may be generated which is derived from 293 cells (which contain El helper functions under the control of a constitutive promoter), but which contain the rep and/or cap proteins under the control of inducible promoters. Still other stable host cells may be generated by one of skill in the art.

The recombinant AAV vector, rep sequences, cap sequences, and helper functions required for producing the rAAV of the disclosure may be delivered to the packaging host cell using any appropriate genetic element (vector). The selected genetic element may be delivered by any suitable method, including those described herein. The methods used to construct any embodiment of this disclosure are known to those with skill in nucleic acid manipulation and include genetic engineering, recombinant engineering, and synthetic techniques. See, e.g., Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, N.Y. Similarly, methods of generating rAAV virions are well known and the selection of a suitable method is not a limitation on the disclosure. See, e.g., K. Fisher et al, J. Virol., 70:520-532 (1993) and U.S. Pat. No. 5,478,745.

In some embodiments, recombinant AAVs may be produced using the triple transfection method (described in detail in U.S. Pat. No. 6,001,650). Typically, the recombinant AAVs are produced by transfecting a host cell with an recombinant AAV vector (comprising a transgene) to be packaged into AAV particles, an AAV helper function vector, and an accessory function vector. An AAV helper function vector encodes the “AAV helper function” sequences (e.g., rep and cap), which function in trans for productive AAV replication and encapsidation. Preferably, the AAV helper function vector supports efficient AAV vector production without generating any detectable wild-type AAV virions (e.g., AAV virions containing functional rep and cap genes). Non-limiting examples of vectors suitable for use with the disclosure include pHLP19, described in U.S. Pat. No. 6,001,650 and pRep6cap6 vector, described in U.S. Pat. No. 6,156,303, the entirety of both incorporated by reference herein. The accessory function vector encodes nucleotide sequences for non-AAV derived viral and/or cellular functions upon which AAV is dependent for replication (e.g., “accessory functions”). The accessory functions include those functions required for AAV replication, including, without limitation, those moieties involved in activation of AAV gene transcription, stage specific AAV mRNA splicing, AAV DNA replication, synthesis of cap expression products, and AAV capsid assembly. Viral-based accessory functions can be derived from any of the known helper viruses such as adenovirus, herpesvirus (other than herpes simplex virus type-1), and vaccinia virus.

In some aspects, the disclosure provides transfected host cells. The term “transfection” is used to refer to the uptake of foreign DNA by a cell, and a cell has been “transfected” when exogenous DNA has been introduced inside the cell membrane. A number of transfection techniques are generally known in the art. See, e.g., Graham et al. (1973) Virology, 52:456, Sambrook et al. (1989) Molecular Cloning, a laboratory manual, Cold Spring Harbor Laboratories, New York, Davis et al. (1986) Basic Methods in Molecular Biology, Elsevier, and Chu et al. (1981) Gene 13:197. Such techniques can be used to introduce one or more exogenous nucleic acids, such as a nucleotide integration vector and other nucleic acid molecules, into suitable host cells.

A “host cell” refers to any cell that harbors, or is capable of harboring, a substance of interest. Often a host cell is a mammalian cell. A host cell may be used as a recipient of an AAV helper construct, an AAV minigene plasmid, an accessory function vector, or other transfer DNA associated with the production of recombinant AAVs. The term includes the progeny of the original cell which has been transfected. Thus, a “host cell” as used herein may refer to a cell which has been transfected with an exogenous DNA sequence. It is understood that the progeny of a single parental cell may not necessarily be completely identical in morphology or in genomic or total DNA complement as the original parent, due to natural, accidental, or deliberate mutation.

As used herein, the term “cell line” refers to a population of cells capable of continuous or prolonged growth and division in vitro. Often, cell lines are clonal populations derived from a single progenitor cell. It is further known in the art that spontaneous or induced changes can occur in karyotype during storage or transfer of such clonal populations. Therefore, cells derived from the cell line referred to may not be precisely identical to the ancestral cells or cultures, and the cell line referred to includes such variants.

As used herein, the terms “recombinant cell” refers to a cell into which an exogenous DNA segment, such as DNA segment that leads to the transcription of a biologically-active polypeptide or production of a biologically active nucleic acid such as an RNA, has been introduced.

As used herein, the term “vector” includes any genetic element, such as a plasmid, phage, transposon, cosmid, chromosome, artificial chromosome, virus, virion, etc., which is capable of replication when associated with the proper control elements and which can transfer gene sequences between cells. Thus, the term includes cloning and expression vehicles, as well as viral vectors. In some embodiments, useful vectors are contemplated to be those vectors in which the nucleic acid segment to be transcribed is positioned under the transcriptional control of a promoter. A “promoter” refers to a DNA sequence recognized by the synthetic machinery of the cell, or introduced synthetic machinery, required to initiate the specific transcription of a gene. The phrases “operatively positioned,” “under control” or “under transcriptional control” means that the promoter is in the correct location and orientation in relation to the nucleic acid to control RNA polymerase initiation and expression of the gene. The term “expression vector or construct” means any type of genetic construct containing a nucleic acid in which part or all of the nucleic acid encoding sequence is capable of being transcribed. In some embodiments, expression includes transcription of the nucleic acid, for example, to generate a biologically-active polypeptide product or inhibitory RNA (e.g., shRNA, miRNA, miRNA inhibitor) from a transcribed gene.

The foregoing methods for packaging recombinant vectors in desired AAV capsids to produce the rAAVs of the disclosure are not meant to be limiting and other suitable methods will be apparent to the skilled artisan.

Recombinant AAV Vectors

“Recombinant AAV (rAAV) vectors” of the disclosure are typically composed of, at a minimum, a transgene and its regulatory sequences, and 5′ and 3′ AAV inverted terminal repeats (ITRs). It is this recombinant AAV vector which is packaged into a capsid protein and delivered to a selected target cell. In some embodiments, the transgene is a nucleic acid sequence, heterologous to the vector sequences, which encodes a polypeptide, protein, functional RNA molecule (e.g., miRNA, miRNA inhibitor) or other gene product, of interest. The nucleic acid coding sequence is operatively linked to regulatory components in a manner which permits transgene transcription, translation, and/or expression in a cell of a target tissue.

The AAV sequences of the vector typically comprise the cis-acting 5′ and 3′ inverted terminal repeat sequences (See, e.g., B. J. Carter, in “Handbook of Parvoviruses”, ed., P. Tijsser, CRC Press, pp. 155 168 (1990)). The ITR sequences are about 145 bp in length. Preferably, substantially the entire sequences encoding the ITRs are used in the molecule, although some degree of minor modification of these sequences is permissible. The ability to modify these ITR sequences is within the skill of the art. (See, e.g., texts such as Sambrook et al, “Molecular Cloning. A Laboratory Manual”, 2d ed., Cold Spring Harbor Laboratory, New York (1989); and K. Fisher et al., J Virol., 70:520 532 (1996)). An example of such a molecule employed in the disclosure is a “cis-acting” plasmid containing the transgene, in which the selected transgene sequence and associated regulatory elements are flanked by the 5′ and 3′ AAV ITR sequences. The AAV ITR sequences may be obtained from any known AAV, including presently identified mammalian AAV types.

In some embodiments, the rAAVs of the disclosure are pseudotyped rAAVs. For example, a pseudotyped AAV vector containing the ITRs of serotype X encapsidated with the proteins of Y will be designated as AAVX/Y (e.g., AAV2/1 has the ITRs of AAV2 and the capsid of AAV1). In some embodiments, pseudotyped rAAVs may be useful for combining the tissue-specific targeting capabilities of a capsid protein from one AAV serotype with the viral DNA from another AAV serotype, thereby allowing targeted delivery of a transgene to a target tissue.

In addition to the major elements identified above for the recombinant AAV vector, the vector also includes conventional control elements necessary which are operably linked to the transgene in a manner which permits its transcription, translation and/or expression in a cell transfected with the plasmid vector or infected with the virus produced by the disclosure. As used herein, “operably linked” sequences include both expression control sequences that are contiguous with the gene of interest and expression control sequences that act in trans or at a distance to control the gene of interest.

Expression control sequences include appropriate transcription initiation, termination, promoter and enhancer sequences; efficient RNA processing signals such as splicing and polyadenylation (polyA) signals; sequences that stabilize cytoplasmic mRNA; sequences that enhance translation efficiency (e.g., Kozak consensus sequence); sequences that enhance protein stability; and when desired, sequences that enhance secretion of the encoded product. A great number of expression control sequences, including promoters which are native, constitutive, inducible and/or tissue-specific, are known in the art and may be utilized.

As used herein, a nucleic acid sequence (e.g., coding sequence) and regulatory sequences are said to be “operably” linked when they are covalently linked in such a way as to place the expression or transcription of the nucleic acid sequence under the influence or control of the regulatory sequences. If it is desired that the nucleic acid sequences be translated into a functional protein, two DNA sequences are said to be operably linked if induction of a promoter in the 5′ regulatory sequences results in the transcription of the coding sequence and if the nature of the linkage between the two DNA sequences does not (1) result in the introduction of a frame-shift mutation, (2) interfere with the ability of the promoter region to direct the transcription of the coding sequences, or (3) interfere with the ability of the corresponding RNA transcript to be translated into a protein. Thus, a promoter region would be operably linked to a nucleic acid sequence if the promoter region were capable of effecting transcription of that DNA sequence such that the resulting transcript might be translated into the desired protein or polypeptide. Similarly two or more coding regions are operably linked when they are linked in such a way that their transcription from a common promoter results in the expression of two or more proteins having been translated in frame. In some embodiments, operably linked coding sequences yield a fusion protein. In some embodiments, operably linked coding sequences yield a functional RNA (e.g., shRNA, miRNA, miRNA inhibitor).

For nucleic acids encoding proteins, a polyadenylation sequence generally is inserted following the transgene sequences and before the 3′ AAV ITR sequence. A rAAV construct useful in the disclosure may also contain an intron, desirably located between the promoter/enhancer sequence and the transgene. One possible intron sequence is derived from SV-40, and is referred to as the SV-40 T intron sequence. Another vector element that may be used is an internal ribosome entry site (IRES). An IRES sequence is used to produce more than one polypeptide from a single gene transcript. An IRES sequence would be used to produce a protein that contain more than one polypeptide chains. Selection of these and other common vector elements are conventional and many such sequences are available [see, e.g., Sambrook et al, and references cited therein at, for example, pages 3.18 3.26 and 16.17 16.27 and Ausubel et al., Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1989]. In some embodiments, a Foot and Mouth Disease Virus 2A sequence is included in polyprotein; this is a small peptide (approximately 18 amino acids in length) that has been shown to mediate the cleavage of polyproteins (Ryan, M D et al., EMBO, 1994; 4: 928-933; Mattion, N M et al., J Virology, November 1996; p. 8124-8127; Furler, S et al., Gene Therapy, 2001; 8: 864-873; and Halpin, C et al., The Plant Journal, 1999; 4: 453-459). The cleavage activity of the 2A sequence has previously been demonstrated in artificial systems including plasmids and gene therapy vectors (AAV and retroviruses) (Ryan, M D et al., EMBO, 1994; 4: 928-933; Mattion, N M et al., J Virology, November 1996; p. 8124-8127; Furler, S et al., Gene Therapy, 2001; 8: 864-873; and Halpin, C et al., The Plant Journal, 1999; 4: 453-459; de Felipe, P et al., Gene Therapy, 1999; 6: 198-208; de Felipe, Petal., Human Gene Therapy, 2000; 11: 1921-1931.; and Klump, H et al., Gene Therapy, 2001; 8: 811-817).

The precise nature of the regulatory sequences needed for gene expression in host cells may vary between species, tissues or cell types, but shall in general include, as necessary, 5′ non-transcribed and 5′ non-translated sequences involved with the initiation of transcription and translation respectively, such as a TATA box, capping sequence, CAAT sequence, enhancer elements, and the like. Especially, such 5′ non-transcribed regulatory sequences will include a promoter region that includes a promoter sequence for transcriptional control of the operably joined gene. Regulatory sequences may also include enhancer sequences or upstream activator sequences as desired. The vectors of the disclosure may optionally include 5′ leader or signal sequences. The choice and design of an appropriate vector is within the ability and discretion of one of ordinary skill in the art.

Examples of constitutive promoters include, without limitation, the retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with the RSV enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV enhancer) [see, e.g., Boshart et al, Cell, 41:521-530 (1985)], the SV40 promoter, the dihydrofolate reductase promoter, the β-actin promoter, the phosphoglycerol kinase (PGK) promoter, and the EF1α promoter [Invitrogen].

Inducible promoters allow regulation of gene expression and can be regulated by exogenously supplied compounds, environmental factors such as temperature, or the presence of a specific physiological state, e.g., acute phase, a particular differentiation state of the cell, or in replicating cells only. Inducible promoters and inducible systems are available from a variety of commercial sources, including, without limitation, Invitrogen, Clontech and Ariad. Many other systems have been described and can be readily selected by one of skill in the art. Examples of inducible promoters regulated by exogenously supplied promoters include the zinc-inducible sheep metallothionine (MT) promoter, the dexamethasone (Dex)-inducible mouse mammary tumor virus (MMTV) promoter, the T7 polymerase promoter system (WO 98/10088); the ecdysone insect promoter (No et al, Proc. Natl. Acad. Sci. USA, 93:3346-3351 (1996)), the tetracycline-repressible system (Gossen et al, Proc. Natl. Acad. Sci. USA, 89:5547-5551 (1992)), the tetracycline-inducible system (Gossen et al, Science, 268:1766-1769 (1995), see also Harvey et al, Curr. Opin. Chem. Biol., 2:512-518 (1998)), the RU486-inducible system (Wang et al, Nat. Biotech., 15:239-243 (1997) and Wang et al, Gene Ther., 4:432-441 (1997)) and the rapamycin-inducible system (Magari et al, J. Clin. Invest., 100:2865-2872 (1997)). Still other types of inducible promoters which may be useful in this context are those which are regulated by a specific physiological state, e.g., temperature, acute phase, a particular differentiation state of the cell, or in replicating cells only.

In another embodiment, the native promoter for the transgene will be used. The native promoter may be preferred when it is desired that expression of the transgene should mimic the native expression. The native promoter may be used when expression of the transgene must be regulated temporally or developmentally, or in a tissue-specific manner, or in response to specific transcriptional stimuli. In a further embodiment, other native expression control elements, such as enhancer elements, polyadenylation sites or Kozak consensus sequences may also be used to mimic the native expression.

In some embodiments, the regulatory sequences impart tissue-specific gene expression capabilities. In some cases, the tissue-specific regulatory sequences bind tissue-specific transcription factors that induce transcription in a tissue specific manner. Such tissue-specific regulatory sequences (e.g., promoters, enhancers, etc.) are well known in the art. Exemplary tissue-specific regulatory sequences include, but are not limited to the following tissue specific promoters: a liver-specific thyroxin binding globulin (TBG) promoter, an insulin promoter, a glucagon promoter, a somatostatin promoter, a pancreatic polypeptide (PPY) promoter, a synapsin-1 (Syn) promoter, a creatine kinase (MCK) promoter, a mammalian desmin (DES) promoter, a α-myosin heavy chain (a-MHC) promoter, or a cardiac Troponin T (cTnT) promoter. Other exemplary promoters include Beta-actin promoter, hepatitis B virus core promoter, Sandig et al., Gene Ther., 3:1002-9 (1996); alpha-fetoprotein (AFP) promoter, Arbuthnot et al., Hum. Gene Ther., 7:1503-14 (1996)), bone osteocalcin promoter (Stein et al., Mol. Biol. Rep., 24:185-96 (1997)); bone sialoprotein promoter (Chen et al., J. Bone Miner. Res., 11:654-64 (1996)), CD2 promoter (Hansal et al., J. Immunol., 161:1063-8 (1998); immunoglobulin heavy chain promoter; T cell receptor α-chain promoter, neuronal such as neuron-specific enolase (NSE) promoter (Andersen et al., Cell. Mol. Neurobiol., 13:503-15 (1993)), neurofilament light-chain gene promoter (Piccioli et al., Proc. Natl. Acad. Sci. USA, 88:5611-5 (1991)), and the neuron-specific vgf gene promoter (Piccioli et al., Neuron, 15:373-84 (1995)), among others which will be apparent to the skilled artisan.

In some embodiments, one or more binding sites for one or more of miRNAs are incorporated in a transgene of a rAAV vector, to inhibit the expression of the transgene in one or more tissues of a subject harboring the transgene. The skilled artisan will appreciate that binding sites may be selected to control the expression of a transgene in a tissue specific manner. For example, binding sites for the liver-specific miR-122 may be incorporated into a transgene to inhibit expression of that transgene in the liver. The target sites in the mRNA may be in the 5′ UTR, the 3′ UTR or in the coding region. Typically, the target site is in the 3′ UTR of the mRNA. Furthermore, the transgene may be designed such that multiple miRNAs regulate the mRNA by recognizing the same or multiple sites. The presence of multiple miRNA binding sites may result in the cooperative action of multiple RISCs and provide highly efficient inhibition of expression. The target site sequence may comprise a total of 5-100, 10-60, or more nucleotides. The target site sequence may comprise at least 5 nucleotides of the sequence of a target gene binding site.

Recombinant AAV Vector: Transgene Coding Sequences

The composition of the transgene sequence of the rAAV vector will depend upon the use to which the resulting vector will be put. For example, one type of transgene sequence includes a reporter sequence, which upon expression produces a detectable signal. In another example, the transgene encodes a therapeutic protein or therapeutic functional RNA. In another example, the transgene encodes a protein or functional RNA that is intended to be used for research purposes, e.g., to create a somatic transgenic animal model harboring the transgene, e.g., to study the function of the transgene product. In another example, the transgene encodes a protein or functional RNA that is intended to be used to create an animal model of disease. Appropriate transgene coding sequences will be apparent to the skilled artisan.

Reporter sequences that may be provided in a transgene include, without limitation, DNA sequences encoding β-lactamase, β-galactosidase (LacZ), alkaline phosphatase, thymidine kinase, green fluorescent protein (GFP), chloramphenicol acetyltransferase (CAT), luciferase, and others well known in the art. When associated with regulatory elements which drive their expression, the reporter sequences, provide signals detectable by conventional means, including enzymatic, radiographic, colorimetric, fluorescence or other spectrographic assays, fluorescent activating cell sorting assays and immunological assays, including enzyme linked immunosorbent assay (ELISA), radioimmunoassay (RIA) and immunohistochemistry. For example, where the marker sequence is the LacZ gene, the presence of the vector carrying the signal is detected by assays for β-galactosidase activity. Where the transgene is green fluorescent protein or luciferase, the vector carrying the signal may be measured visually by color or light production in a luminometer. Such reporters can, for example, be useful in verifying the tissue-specific targeting capabilities and tissue specific promoter regulatory activity of an rAAV.

In some aspects, the disclosure provides rAAV vectors for use in methods of preventing or treating one or more genetic deficiencies or dysfunctions in a mammal, such as for example, a polypeptide deficiency or polypeptide excess in a mammal, and particularly for treating or reducing the severity or extent of deficiency in a human manifesting one or more of the disorders linked to a deficiency in such polypeptides in cells and tissues. The method involves administration of an rAAV vector that encodes one or more therapeutic peptides, polypeptides, siRNAs, microRNAs, antisense nucleotides, etc. in a pharmaceutically-acceptable carrier to the subject in an amount and for a period of time sufficient to treat the deficiency or disorder in the subject suffering from such a disorder.

Thus, the disclosure embraces the delivery of rAAV vectors encoding one or more peptides, polypeptides, or proteins, which are useful for the treatment or prevention of disease states in a mammalian subject. Exemplary therapeutic proteins include one or more polypeptides selected from the group consisting of growth factors, interleukins, interferons, anti-apoptosis factors, cytokines, anti-diabetic factors, anti-apoptosis agents, coagulation factors, anti-tumor factors. Other non-limiting examples of therapeutic proteins include BDNF, CNTF, CSF, EGF, FGF, G-SCF, GM-CSF, gonadotropin, IFN, IFG-1, M-CSF, NGF, PDGF, PEDF, TGF, VEGF, TGF-B2, TNF, prolactin, somatotropin, XIAP1, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-10(187A), viral IL-10, IL-11, IL-12, IL-13, IL-14, IL-15, IL-16 IL-17, and IL-18.

The rAAV vectors may comprise a gene to be transferred to a subject to treat a disease associated with reduced expression, lack of expression or dysfunction of the gene. Exemplary genes and associated disease states include, but are not limited to: glucose-6-phosphatase, associated with glycogen storage deficiency type 1A; phosphoenolpyruvate-carboxykinase, associated with Pepck deficiency; galactose-1 phosphate uridyl transferase, associated with galactosemia; phenylalanine hydroxylase, associated with phenylketonuria; branched chain alpha-ketoacid dehydrogenase, associated with Maple syrup urine disease; fumarylacetoacetate hydrolase, associated with tyrosinemia type 1; methylmalonyl-CoA mutase, associated with methylmalonic acidemia; medium chain acyl CoA dehydrogenase, associated with medium chain acetyl CoA deficiency; omithine transcarbamylase, associated with omithine transcarbamylase deficiency; argininosuccinic acid synthetase, associated with citrullinemia; low density lipoprotein receptor protein, associated with familial hypercholesterolemia; UDP-glucouronosyltransferase, associated with Crigler-Najjar disease; adenosine deaminase, associated with severe combined immunodeficiency disease; hypoxanthine guanine phosphoribosyl transferase, associated with Gout and Lesch-Nyan syndrome; biotinidase, associated with biotinidase deficiency; beta-galactosidase, associated with GM1 gangliosidosis; beta-hexosaminidase A and B, associated with Tay-Sachs disease and Sandhoff disease; beta-glucocerebrosidase, associated with Gaucher disease; beta-glucuronidase, associated with Sly syndrome; peroxisome membrane protein 70 kDa, associated with Zellweger syndrome; porphobilinogen deaminase, associated with acute intermittent porphyria; alpha-1 antitrypsin for treatment of alpha-1 antitrypsin deficiency (emphysema); erythropoietin for treatment of anemia due to thalassemia or to renal failure; vascular endothelial growth factor, angiopoietin-1, and fibroblast growth factor for the treatment of ischemic diseases; thrombomodulin and tissue factor pathway inhibitor for the treatment of occluded blood vessels as seen in, for example, atherosclerosis, thrombosis, or embolisms; aromatic amino acid decarboxylase (AADC), and tyrosine hydroxylase (TH) for the treatment of Parkinson's disease; the beta adrenergic receptor, anti-sense to, or a mutant form of, phospholamban, the sarco(endo)plasmic reticulum adenosine triphosphatase-2 (SERCA2), and the cardiac adenylyl cyclase for the treatment of congestive heart failure; a tumor suppessor gene such as p53 for the treatment of various cancers; a cytokine such as one of the various interleukins for the treatment of inflammatory and immune disorders and cancers; dystrophin or minidystrophin and utrophin or miniutrophin for the treatment of muscular dystrophies; and, insulin for the treatment of diabetes.

In some embodiments, The rAAV vectors may comprise a gene encoding an antigen-binding protein, such as an immunoglobulin heavy chain or light chain or fragment thereof, e.g., that may be used for therapeutic purposes. In some embodiments, the protein is a single chain Fv fragment or Fv-Fc fragment. Accordingly, in some embodiments, the rAAV can be used to infect cells are of target tissue (e.g., muscle tissue) to engineer cells of the tissue to express an antigen-binding protein, such as an antibody or fragment thereof. In some embodiments, to generate rAAVs that express the antibodies or antigen binding fragments, cDNAs engineered to express such proteins will be subcloned into an appropriate plasmid backbone and packaged into an rAAV.

In some embodiments, the rAAV vectors may comprise a gene or genes encoding genome editing enzymes or related molecules. As used herein, “genome editing” refers to adding, disrupting or changing genomic sequences (e.g., a gene sequence). In some embodiments, genome editing is performed using engineered proteins and related molecules. In some aspects, genome editing comprises the use of engineered nucleases to cleave a target genomic locus. In some embodiments, genome editing further comprises inserting, deleting, mutating or substituting nucleic acid residues at a cleaved locus. In some embodiments, inserting, deleting, mutating or substituting nucleic acid residues at a cleaved locus is accomplished through endogenous cellular mechanisms such as homologous recombination (HR) and non-homologous end joining (NHEJ). Exemplary genome editing technologies include, but are not limited to Transcription Activator-like Effector Nucleases (TALENs), Zinc Finger Nucleases (ZFNs), engineered meganuclease re-engineered homing endonucleases and the CRISPR/Cas system. In some embodiments, the rAAV may comprise a gene or genes encoding proteins or molecules related to TALENs, including but not limited to transcription activator-like effectors (TALEs) and restriction endonucleases (e.g., FokI). In some embodiments, the rAAV may comprise a gene or genes encoding proteins or molecules related to ZFNs, including but not limited to proteins comprising the Cys2His2 fold group (for example Zif268 (EGR1)), and restriction endonucleases (e.g., FokI). In some embodiments, the rAAV may comprise a gene or genes encoding proteins or molecules related to the CRISPR/Cas system, including but not limited to Cas9, Cas6, dCas9, CRISPR RNA (crRNA) and trans-activating crRNA (tracrRNA).

The rAAVs of the disclosure can be used to restore the expression of genes that are reduced in expression, silenced, or otherwise dysfunctional in a subject (e.g., a tumor suppressor that has been silenced in a subject having cancer). The rAAVs of the disclosure can also be used to knockdown the expression of genes that are aberrantly expressed in a subject (e.g., an oncogene that is expressed in a subject having cancer). In some embodiments, an rAAV vector comprising a nucleic acid encoding a gene product associated with cancer (e.g., tumor suppressors) may be used to treat the cancer, by administering a rAAV harboring the rAAV vector to a subject having the cancer. In some embodiments, an rAAV vector comprising a nucleic acid encoding a small interfering nucleic acid (e.g., shRNAs, miRNAs) that inhibits the expression of a gene product associated with cancer (e.g., oncogenes) may be used to treat the cancer, by administering a rAAV harboring the rAAV vector to a subject having the cancer. In some embodiments, an rAAV vector comprising a nucleic acid encoding a gene product associated with cancer (or a functional RNA that inhibits the expression of a gene associated with cancer) may be used for research purposes, e.g., to study the cancer or to identify therapeutics that treat the cancer. The following is a non-limiting list of exemplary genes known to be associated with the development of cancer (e.g., oncogenes and tumor suppressors): AARS, ABCB1, ABCC4, ABI2, ABL1, ABL2, ACK1, ACP2, ACY1, ADSL, AK1, AKR1C2, AKT1, ALB, ANPEP, ANXA5, ANXA7, AP2M1, APC, ARHGAP5, ARHGEF5, ARID4A, ASNS, ATF4, ATM, ATP5B, ATP5O, AXL, BARD1, BAX, BCL2, BHLHB2, BLMH, BRAF, BRCA1, BRCA2, BTK, CANX, CAP1, CAPN1, CAPNS1, CAV1, CBFB, CBLB, CCL2, CCND1, CCND2, CCND3, CCNE1, CCT5, CCYR61, CD24, CD44, CD59, CDCl20, CDCl25, CDCl25A, CDCl25B, CDCl2L5, CDK10, CDK4, CDK5, CDK9, CDKL1, CDKN1A, CDKN1B, CDKN1C, CDKN2A, CDKN2B, CDKN2D, CEBPG, CENPC1, CGRRF1, CHAF1A, CIB1, CKMT1, CLK1, CLK2, CLK3, CLNS1A, CLTC, COL1A1, COL6A3, COX6C, COX7A2, CRAT, CRHR1, CSF1R, CSK, CSNK1G2, CTNNA1, CTNNB1, CTPS, CTSC, CTSD, CUL1, CYR61, DCC, DCN, DDX10, DEK, DHCR7, DHRS2, DHX8, DLG3, DVL1, DVL3, E2F1, E2F3, E2F5, EGFR, EGR1, EIF5, EPHA2, ERBB2, ERBB3, ERBB4, ERCC3, ETV1, ETV3, ETV6, F2R, FASTK, FBN1, FBN2, FES, FGFR1, FGR, FKBP8, FN1, FOS, FOSL1, FOSL2, FOXG1A, FOXO1A, FRAP1, FRZB, FTL, FZD2, FZD5, FZD9, G22P1, GAS6, GCN5L2, GDF15, GNA13, GNAS, GNB2, GNB2L1, GPR39, GRB2, GSK3A, GSPT1, GTF2I, HDAC1, HDGF, HMMR, HPRT1, HRB, HSPA4, HSPA5, HSPA8, HSPB1, HSPH1, HYAL1, HYOU1, ICAM1, ID1, ID2, IDUA, IER3, IFITM1, IGF1R, IGF2R, IGFBP3, IGFBP4, IGFBP5, IL1B, ILK, ING1, IRF3, ITGA3, ITGA6, ITGB4, JAK1, JARID1A, JUN, JUNB, JUND, K-ALPHA-1, KIT, KITLG, KLK10, KPNA2, KRAS2, KRT18, KRT2A, KRT9, LAMB1, LAMP2, LCK, LCN2, LEP, LITAF, LRPAP1, LTF, LYN, LZTR1, MADH1, MAP2K2, MAP3K8, MAPK12, MAPK13, MAPKAPK3, MAPRE1, MARS, MAS1, MCC, MCM2, MCM4, MDM2, MDM4, MET, MGST1, MICB, MLLT3, MME, MMP1, MMP14, MMP17, MMP2, MNDA, MSH2, MSH6, MT3, MYB, MYBL1, MYBL2, MYC, MYCL1, MYCN, MYD88, MYL9, MYLK, NEO1, NF1, NF2, NFKB1, NFKB2, NFSF7, NID, NINJ1, NMBR, NME1, NME2, NME3, NOTCH1, NOTCH2, NOTCH4, NPM1, NQO1, NR1D1, NR2F1, NR2F6, NRAS, NRG1, NSEP1, OSM, PA2G4, PABPC1, PCNA, PCTK1, PCTK2, PCTK3, PDGFA, PDGFB, PDGFRA, PDPK1, PEA15, PFDN4, PFDN5, PGAM1, PHB, PIK3CA, PIK3CB, PIK3CG, PIM1, PKM2, PKMYT1, PLK2, PPARD, PPARG, PPIH, PPP1CA, PPP2R5A, PRDX2, PRDX4, PRKAR1A, PRKCBP1, PRNP, PRSS15, PSMA1, PTCH, PTEN, PTGS1, PTMA, PTN, PTPRN, RAB5A, RAC1, RAD50, RAF1, RALBP1, RAP1A, RARA, RARB, RASGRF1, RB1, RBBP4, RBL2, REA, REL, RELA, RELB, RET, RFC2, RGS19, RHOA, RHOB, RHOC, RHOD, RIPK1, RPN2, RPS6KB1, RRM1, SARS, SELENBP1, SEMA3C, SEMA4D, SEPP1, SERPINH1, SFN, SFPQ, SFRS7, SHB, SHH, SIAH2, SIVA, SIVA TP53, SKI, SKIL, SLC16A1, SLC1A4, SLC20A1, SMO, SMPD1, SNAI2, SND1, SNRPB2, SOCS1, SOCS3, SOD1, SORT1, SPINT2, SPRY2, SRC, SRPX, STAT1, STAT2, STAT3, STAT5B, STC1, TAF1, TBL3, TBRG4, TCF1, TCF7L2, TFAP2C, TFDP1, TFDP2, TGFA, TGFB1, TGFBI, TGFBR2, TGFBR3, THBS1, TIE, TIMP1, TIMP3, TJP1, TK1, TLE1, TNF, TNFRSF10A, TNFRSF10B, TNFRSF1A, TNFRSF1B, TNFRSF6, TNFSF7, TNK1, TOB1, TP53, TP53BP2, TP53I3, TP73, TPBG, TPT1, TRADD, TRAM1, TRRAP, TSG101, TUFM, TXNRD1, TYRO3, UBC, UBE2L6, UCHL1, USP7, VDAC1, VEGF, VHL, VIL2, WEE1, WNT1, WNT2, WNT2B, WNT3, WNT5A, WT1, XRCC1, YES1, YWHAB, YWHAZ, ZAP70, and ZNF9.

A rAAV vector may comprise as a transgene, a nucleic acid encoding a protein or functional RNA that modulates apoptosis. The following is a non-limiting list of genes associated with apoptosis and nucleic acids encoding the products of these genes and their homologues and encoding small interfering nucleic acids (e.g., shRNAs, miRNAs) that inhibit the expression of these genes and their homologues are useful as transgenes in certain embodiments of the disclosure: RPS27A, ABL1, AKT1, APAF1, BAD, BAG1, BAG3, BAG4, BAK1, BAX, BCL10, BCL2, BCL2A1, BCL2L1, BCL2L10, BCL2L11, BCL2L12, BCL2L13, BCL2L2, BCLAF1, BFAR, BID, BIK, NAIP, BIRC2, BIRC3, XIAP, BRCS, BIRC6, BIRC7, BIRC8, BNIP1, BNIP2, BNIP3, BNIP3L, BOK, BRAF, CARD10, CARD11, NLRC4, CARD14, NOD2, NOD1, CARD6, CARD8, CARD9, CASP1, CASP10, CASP14, CASP2, CASP3, CASP4, CASP5, CASP6, CASP7, CASP8, CASP9, CFLAR, CIDEA, CIDEB, CRADD, DAPK1, DAPK2, DFFA, DFFB, FADD, GADD45A, GDNF, HRK, IGF1R, LTA, LTBR, MCL1, NOL3, PYCARD, RIPK1, RIPK2, TNF, TNFRSF10A, TNFRSF10B, TNFRSF10C, TNFRSF10D, TNFRSF11B, TNFRSF12A, TNFRSF14, TNFRSF19, TNFRSF1A, TNFRSF1B, TNFRSF21, TNFRSF25, CD40, FAS, TNFRSF6B, CD27, TNFRSF9, TNFSF10, TNFSF14, TNFSF18, CD40LG, FASLG, CD70, TNFSF8, TNFSF9, TP53, TP53BP2, TP73, TP63, TRADD, TRAF1, TRAF2, TRAF3, TRAF4, and TRAF5.

In some aspects, the disclosure relates to methods and compositions for treating CNS-related disorders. As used herein, a “CNS-related disorder” is a disease or condition of the central nervous system. A CNS-related disorder may affect the spinal cord (e.g., a myelopathy), brain (e.g., a encephalopathy) or tissues surrounding the brain and spinal cord. A CNS-related disorder may be of a genetic origin, either inherited or acquired through a somatic mutation. A CNS-related disorder may be a psychological condition or disorder, e.g., Attention Deficient Hyperactivity Disorder, Autism Spectrum Disorder, Mood Disorder, Schizophrenia, Depression, Rett Syndrome, etc. A CNS-related disorder may be an autoimmune disorder. A CNS-related disorder may also be a cancer of the CNS, e.g., brain cancer. A CNS-related disorder that is a cancer may be a primary cancer of the CNS, e.g., an astrocytoma, glioblastomas, etc., or may be a cancer that has metastasized to CNS tissue, e.g., a lung cancer that has metastasized to the brain. Further non-limiting examples of CNS-related disorders, include Parkinson's Disease, Lysosomal Storage Disease, Ischemia, Neuropathic Pain, Amyotrophic lateral sclerosis (ALS), Multiple Sclerosis (MS), and Canavan disease (CD).

In some embodiments, the disclosure relates to a rAAV vector comprising a transgene, a nucleic acid encoding a protein or functional RNA useful for the treatment of a condition, disease or disorder associated with the central nervous system (CNS). The following is a non-limiting list of genes associated with CNS disease: DRD2, GRIA1, GRIA2, GRIN1, SLC1A1, SYP, SYT1, CHRNA7, 3Rtau/4rTUS, APP, BAX, BCL-2, GRIK1, GFAP, IL-1, AGER, associated with Alzheimer's Disease; UCH-L1, SKP1, EGLN1, Nurr-1, BDNF, TrkB, gstml, S106B, associated with Parkinson's Disease; IT15, PRNP, JPH3, TBP, ATXN1, ATXN2, ATXN3, Atrophin 1, FTL, TITF-1, associated with Huntington's Disease; FXN, associated with Freidrich's ataxia; ASPA, associated with Canavan's Disease; DMD, associated with muscular dystrophy; and SMN1, UBE1, DYNC1H1 associated with spinal muscular atrophy.

The skilled artisan will also realize that in the case of transgenes encoding proteins or polypeptides, that mutations that results in conservative amino acid substitutions may be made in a transgene to provide functionally equivalent variants, or homologs of a protein or polypeptide. In some aspects the disclosure embraces sequence alterations that result in conservative amino acid substitution of a transgene. In some embodiments, the transgene comprises a gene having a dominant negative mutation. For example, a transgene may express a mutant protein that interacts with the same elements as a wild-type protein, and thereby blocks some aspect of the function of the wild-type protein.

Useful transgene products also include miRNAs. miRNAs and other small interfering nucleic acids regulate gene expression via target RNA transcript cleavage/degradation or translational repression of the target messenger RNA (mRNA). miRNAs are natively expressed, typically as final 19-25 non-translated RNA products. miRNAs exhibit their activity through sequence-specific interactions with the 3′ untranslated regions (UTR) of target mRNAs. These endogenously expressed miRNAs form hairpin precursors which are subsequently processed into a miRNA duplex, and further into a “mature” single stranded miRNA molecule. This mature miRNA guides a multiprotein complex, miRISC, which identifies target site, e.g., in the 3′ UTR regions, of target mRNAs based upon their complementarity to the mature miRNA.

The following non-limiting list of miRNA genes, and their homologues, are useful as transgenes or as targets for small interfering nucleic acids encoded by transgenes (e.g., miRNA sponges, antisense oligonucleotides, TuD RNAs) in certain embodiments of the methods: hsa-let-7a, hsa-let-7a*, hsa-let-7b, hsa-let-7b*, hsa-let-7c, hsa-let-7c*, hsa-let-7d, hsa-let-7d*, hsa-let-7e, hsa-let-7e*, hsa-let-7f, hsa-let-7f-1*, hsa-let-7f-2*, hsa-let-7g, hsa-let-7g*, hsa-let-7i, hsa-let-7i*, hsa-miR-1, hsa-miR-100, hsa-miR-100*, hsa-miR-101, hsa-miR-101*, hsa-miR-103, hsa-miR-105, hsa-miR-105*, hsa-miR-106a, hsa-miR-106a*, hsa-miR-106b, hsa-miR-106b*, hsa-miR-107, hsa-miR-10a, hsa-miR-10a*, hsa-miR-10b, hsa-miR-10b*, hsa-miR-1178, hsa-miR-1179, hsa-miR-1180, hsa-miR-1181, hsa-miR-1182, hsa-miR-1183, hsa-miR-1184, hsa-miR-1185, hsa-miR-1197, hsa-miR-1200, hsa-miR-1201, hsa-miR-1202, hsa-miR-1203, hsa-miR-1204, hsa-miR-1205, hsa-miR-1206, hsa-miR-1207-3p, hsa-miR-1207-5p, hsa-miR-1208, hsa-miR-122, hsa-miR-122*, hsa-miR-1224-3p, hsa-miR-1224-5p, hsa-miR-1225-3p, hsa-miR-1225-5p, hsa-miR-1226, hsa-miR-1226*, hsa-miR-1227, hsa-miR-1228, hsa-miR-1228*, hsa-miR-1229, hsa-miR-1231, hsa-miR-1233, hsa-miR-1234, hsa-miR-1236, hsa-miR-1237, hsa-miR-1238, hsa-miR-124, hsa-miR-124*, hsa-miR-1243, hsa-miR-1244, hsa-miR-1245, hsa-miR-1246, hsa-miR-1247, hsa-miR-1248, hsa-miR-1249, hsa-miR-1250, hsa-miR-1251, hsa-miR-1252, hsa-miR-1253, hsa-miR-1254, hsa-miR-1255a, hsa-miR-1255b, hsa-miR-1256, hsa-miR-1257, hsa-miR-1258, hsa-miR-1259, hsa-miR-125a-3p, hsa-miR-125a-5p, hsa-miR-125b, hsa-miR-125b-1*, hsa-miR-125b-2*, hsa-miR-126, hsa-miR-126*, hsa-miR-1260, hsa-miR-1261, hsa-miR-1262, hsa-miR-1263, hsa-miR-1264, hsa-miR-1265, hsa-miR-1266, hsa-miR-1267, hsa-miR-1268, hsa-miR-1269, hsa-miR-1270, hsa-miR-1271, hsa-miR-1272, hsa-miR-1273, hsa-miR-127-3p, hsa-miR-1274a, hsa-miR-1274b, hsa-miR-1275, hsa-miR-12′7-5p, hsa-miR-1276, hsa-miR-1277, hsa-miR-1278, hsa-miR-1279, hsa-miR-128, hsa-miR-1280, hsa-miR-1281, hsa-miR-1282, hsa-miR-1283, hsa-miR-1284, hsa-miR-1285, hsa-miR-1286, hsa-miR-1287, hsa-miR-1288, hsa-miR-1289, hsa-miR-129*, hsa-miR-1290, hsa-miR-1291, hsa-miR-1292, hsa-miR-1293, hsa-miR-129-3p, hsa-miR-1294, hsa-miR-1295, hsa-miR-129-5p, hsa-miR-1296, hsa-miR-1297, hsa-miR-1298, hsa-miR-1299, hsa-miR-1300, hsa-miR-1301, hsa-miR-1302, hsa-miR-1303, hsa-miR-1304, hsa-miR-1305, hsa-miR-1306, hsa-miR-1307, hsa-miR-1308, hsa-miR-130a, hsa-miR-130a*, hsa-miR-130b, hsa-miR-130b*, hsa-miR-132, hsa-miR-132*, hsa-miR-1321, hsa-miR-1322, hsa-miR-1323, hsa-miR-1324, hsa-miR-133a, hsa-miR-133b, hsa-miR-134, hsa-miR-135a, hsa-miR-135a*, hsa-miR-135b, hsa-miR-135b*, hsa-miR-136, hsa-miR-136*, hsa-miR-137, hsa-miR-138, hsa-miR-138-1*, hsa-miR-138-2*, hsa-miR-139-3p, hsa-miR-139-5p, hsa-miR-140-3p, hsa-miR-140-5p, hsa-miR-141, hsa-miR-141*, hsa-miR-142-3p, hsa-miR-142-5p, hsa-miR-143, hsa-miR-143*, hsa-miR-144, hsa-miR-144*, hsa-miR-145, hsa-miR-145*, hsa-miR-146a, hsa-miR-146a*, hsa-miR-146b-3p, hsa-miR-146b-5p, hsa-miR-147, hsa-miR-147b, hsa-miR-148a, hsa-miR-148a*, hsa-miR-148b, hsa-miR-148b*, hsa-miR-149, hsa-miR-149*, hsa-miR-150, hsa-miR-150*, hsa-miR-151-3p, hsa-miR-151-5p, hsa-miR-152, hsa-miR-153, hsa-miR-154, hsa-miR-154*, hsa-miR-155, hsa-miR-155*, hsa-miR-15a, hsa-miR-15a*, hsa-miR-15b, hsa-miR-15b*, hsa-miR-16, hsa-miR-16-1*, hsa-miR-16-2*, hsa-miR-17, hsa-miR-17*, hsa-miR-181a, hsa-miR-181a*, hsa-miR-181a-2*, hsa-miR-181b, hsa-miR-181c, hsa-miR-181c*, hsa-miR-181d, hsa-miR-182, hsa-miR-182*, hsa-miR-1825, hsa-miR-1826, hsa-miR-1827, hsa-miR-183, hsa-miR-183*, hsa-miR-184, hsa-miR-185, hsa-miR-185*, hsa-miR-186, hsa-miR-186*, hsa-miR-187, hsa-miR-187*, hsa-miR-188-3p, hsa-miR-188-5p, hsa-miR-18a, hsa-miR-18a*, hsa-miR-18b, hsa-miR-18b*, hsa-miR-190, hsa-miR-190b, hsa-miR-191, hsa-miR-191*, hsa-miR-192, hsa-miR-192*, hsa-miR-193a-3p, hsa-miR-193a-5p, hsa-miR-193b, hsa-miR-193b*, hsa-miR-194, hsa-miR-194*, hsa-miR-195, hsa-miR-195*, hsa-miR-196a, hsa-miR-196a*, hsa-miR-196b, hsa-miR-197, hsa-miR-198, hsa-miR-199a-3p, hsa-miR-199a-5p, hsa-miR-199b-5p, hsa-miR-19a, hsa-miR-19a*, hsa-miR-19b, hsa-miR-19b-1*, hsa-miR-19b-2*, hsa-miR-200a, hsa-miR-200a*, hsa-miR-200b, hsa-miR-200b*, hsa-miR-200c, hsa-miR-200c*, hsa-miR-202, hsa-miR-202*, hsa-miR-203, hsa-miR-204, hsa-miR-205, hsa-miR-206, hsa-miR-208a, hsa-miR-208b, hsa-miR-20a, hsa-miR-20a*, hsa-miR-20b, hsa-miR-20b*, hsa-miR-21, hsa-miR-21*, hsa-miR-210, hsa-miR-211, hsa-miR-212, hsa-miR-214, hsa-miR-214*, hsa-miR-215, hsa-miR-216a, hsa-miR-216b, hsa-miR-217, hsa-miR-218, hsa-miR-218-1*, hsa-miR-218-2*, hsa-miR-219-1-3p, hsa-miR-219-2-3p, hsa-miR-219-5p, hsa-miR-22, hsa-miR-22*, hsa-miR-220a, hsa-miR-220b, hsa-miR-220c, hsa-miR-221, hsa-miR-221*, hsa-miR-222, hsa-miR-222*, hsa-miR-223, hsa-miR-223*, hsa-miR-224, hsa-miR-23a, hsa-miR-23a*, hsa-miR-23b, hsa-miR-23b*, hsa-miR-24, hsa-miR-24-1*, hsa-miR-24-2*, hsa-miR-25, hsa-miR-25*, hsa-miR-26a, hsa-miR-26a-1*, hsa-miR-26a-2*, hsa-miR-26b, hsa-miR-26b*, hsa-miR-27a, hsa-miR-27a*, hsa-miR-27b, hsa-miR-27b*, hsa-miR-28-3p, hsa-miR-28-5p, hsa-miR-296-3p, hsa-miR-296-5p, hsa-miR-297, hsa-miR-298, hsa-miR-299-3p, hsa-miR-299-5p, hsa-miR-29a, hsa-miR-29a*, hsa-miR-29b, hsa-miR-29b-1*, hsa-miR-29b-2*, hsa-miR-29c, hsa-miR-29c*, hsa-miR-300, hsa-miR-301a, hsa-miR-301b, hsa-miR-302a, hsa-miR-302a*, hsa-miR-302b, hsa-miR-302b*, hsa-miR-302c, hsa-miR-302c*, hsa-miR-302d, hsa-miR-302d*, hsa-miR-302e, hsa-miR-302f, hsa-miR-30a, hsa-miR-30a*, hsa-miR-30b, hsa-miR-30b*, hsa-miR-30c, hsa-miR-30c-1*, hsa-miR-30c-2*, hsa-miR-30d, hsa-miR-30d*, hsa-miR-30e, hsa-miR-30e*, hsa-miR-31, hsa-miR-31*, hsa-miR-32, hsa-miR-32*, hsa-miR-320a, hsa-miR-320b, hsa-miR-320c, hsa-miR-320d, hsa-miR-323-3p, hsa-miR-323-5p, hsa-miR-324-3p, hsa-miR-324-5p, hsa-miR-325, hsa-miR-326, hsa-miR-328, hsa-miR-329, hsa-miR-330-3p, hsa-miR-330-5p, hsa-miR-331-3p, hsa-miR-331-5p, hsa-miR-335, hsa-miR-335*, hsa-miR-337-3p, hsa-miR-337-5p, hsa-miR-338-3p, hsa-miR-338-5p, hsa-miR-339-3p, hsa-miR-339-5p, hsa-miR-33a, hsa-miR-33a*, hsa-miR-33b, hsa-miR-33b*, hsa-miR-340, hsa-miR-340*, hsa-miR-342-3p, hsa-miR-342-5p, hsa-miR-345, hsa-miR-346, hsa-miR-34a, hsa-miR-34a*, hsa-miR-34b, hsa-miR-34b*, hsa-miR-34c-3p, hsa-miR-34c-5p, hsa-miR-361-3p, hsa-miR-361-5p, hsa-miR-362-3p, hsa-miR-362-5p, hsa-miR-363, hsa-miR-363*, hsa-miR-365, hsa-miR-367, hsa-miR-367*, hsa-miR-369-3p, hsa-miR-369-5p, hsa-miR-370, hsa-miR-371-3p, hsa-miR-371-5p, hsa-miR-372, hsa-miR-373, hsa-miR-373*, hsa-miR-374a, hsa-miR-374a*, hsa-miR-374b, hsa-miR-374b*, hsa-miR-375, hsa-miR-376a, hsa-miR-376a*, hsa-miR-376b, hsa-miR-376c, hsa-miR-377, hsa-miR-377*, hsa-miR-378, hsa-miR-378*, hsa-miR-379, hsa-miR-379*, hsa-miR-380, hsa-miR-380*, hsa-miR-381, hsa-miR-382, hsa-miR-383, hsa-miR-384, hsa-miR-409-3p, hsa-miR-409-5p, hsa-miR-410, hsa-miR-411, hsa-miR-411*, hsa-miR-412, hsa-miR-421, hsa-miR-422a, hsa-miR-423-3p, hsa-miR-423-5p, hsa-miR-424, hsa-miR-424*, hsa-miR-425, hsa-miR-425*, hsa-miR-429, hsa-miR-431, hsa-miR-431*, hsa-miR-432, hsa-miR-432*, hsa-miR-433, hsa-miR-448, hsa-miR-449a, hsa-miR-449b, hsa-miR-450a, hsa-miR-450b-3p, hsa-miR-450b-5p, hsa-miR-451, hsa-miR-452, hsa-miR-452*, hsa-miR-453, hsa-miR-454, hsa-miR-454*, hsa-miR-455-3p, hsa-miR-455-5p, hsa-miR-483-3p, hsa-miR-483-5p, hsa-miR-484, hsa-miR-485-3p, hsa-miR-485-5p, hsa-miR-486-3p, hsa-miR-486-5p, hsa-miR-487a, hsa-miR-487b, hsa-miR-488, hsa-miR-488*, hsa-miR-489, hsa-miR-490-3p, hsa-miR-490-5p, hsa-miR-491-3p, hsa-miR-491-5p, hsa-miR-492, hsa-miR-493, hsa-miR-493*, hsa-miR-494, hsa-miR-495, hsa-miR-496, hsa-miR-497, hsa-miR-497*, hsa-miR-498, hsa-miR-499-3p, hsa-miR-499-5p, hsa-miR-500, hsa-miR-500*, hsa-miR-501-3p, hsa-miR-501-5p, hsa-miR-502-3p, hsa-miR-502-5p, hsa-miR-503, hsa-miR-504, hsa-miR-505, hsa-miR-505*, hsa-miR-506, hsa-miR-507, hsa-miR-508-3p, hsa-miR-508-5p, hsa-miR-509-3-5p, hsa-miR-509-3p, hsa-miR-509-5p, hsa-miR-510, hsa-miR-511, hsa-miR-512-3p, hsa-miR-512-5p, hsa-miR-513a-3p, hsa-miR-513a-5p, hsa-miR-513b, hsa-miR-513c, hsa-miR-514, hsa-miR-515-3p, hsa-miR-515-5p, hsa-miR-516a-3p, hsa-miR-516a-5p, hsa-miR-516b, hsa-miR-517*, hsa-miR-517a, hsa-miR-517b, hsa-miR-517c, hsa-miR-518a-3p, hsa-miR-518a-5p, hsa-miR-518b, hsa-miR-518c, hsa-miR-518c*, hsa-miR-518d-3p, hsa-miR-518d-5p, hsa-miR-518e, hsa-miR-518e*, hsa-miR-518f, hsa-miR-518f*, hsa-miR-519a, hsa-miR-519b-3p, hsa-miR-519c-3p, hsa-miR-519d, hsa-miR-519e, hsa-miR-519e*, hsa-miR-520a-3p, hsa-miR-520a-5p, hsa-miR-520b, hsa-miR-520c-3p, hsa-miR-520d-3p, hsa-miR-520d-5p, hsa-miR-520e, hsa-miR-520f, hsa-miR-520g, hsa-miR-520h, hsa-miR-521, hsa-miR-522, hsa-miR-523, hsa-miR-524-3p, hsa-miR-524-5p, hsa-miR-525-3p, hsa-miR-525-5p, hsa-miR-526b, hsa-miR-526b*, hsa-miR-532-3p, hsa-miR-532-5p, hsa-miR-539, hsa-miR-541, hsa-miR-541*, hsa-miR-542-3p, hsa-miR-542-5p, hsa-miR-543, hsa-miR-544, hsa-miR-545, hsa-miR-545*, hsa-miR-548a-3p, hsa-miR-548a-5p, hsa-miR-548b-3p, hsa-miR-548b-5p, hsa-miR-548c-3p, hsa-miR-548c-5p, hsa-miR-548d-3p, hsa-miR-548d-5p, hsa-miR-548e, hsa-miR-548f, hsa-miR-548g, hsa-miR-548h, hsa-miR-548i, hsa-miR-548j, hsa-miR-548k, hsa-miR-548l, hsa-miR-548m, hsa-miR-548n, hsa-miR-548o, hsa-miR-548p, hsa-miR-549, hsa-miR-550, hsa-miR-550*, hsa-miR-551a, hsa-miR-551b, hsa-miR-551b*, hsa-miR-552, hsa-miR-553, hsa-miR-554, hsa-miR-555, hsa-miR-556-3p, hsa-miR-556-5p, hsa-miR-557, hsa-miR-558, hsa-miR-559, hsa-miR-561, hsa-miR-562, hsa-miR-563, hsa-miR-564, hsa-miR-566, hsa-miR-567, hsa-miR-568, hsa-miR-569, hsa-miR-570, hsa-miR-571, hsa-miR-572, hsa-miR-573, hsa-miR-574-3p, hsa-miR-574-5p, hsa-miR-575, hsa-miR-576-3p, hsa-miR-576-5p, hsa-miR-577, hsa-miR-578, hsa-miR-579, hsa-miR-580, hsa-miR-581, hsa-miR-582-3p, hsa-miR-582-5p, hsa-miR-583, hsa-miR-584, hsa-miR-585, hsa-miR-586, hsa-miR-587, hsa-miR-588, hsa-miR-589, hsa-miR-589*, hsa-miR-590-3p, hsa-miR-590-5p, hsa-miR-591, hsa-miR-592, hsa-miR-593, hsa-miR-593*, hsa-miR-595, hsa-miR-596, hsa-miR-597, hsa-miR-598, hsa-miR-599, hsa-miR-600, hsa-miR-601, hsa-miR-602, hsa-miR-603, hsa-miR-604, hsa-miR-605, hsa-miR-606, hsa-miR-607, hsa-miR-608, hsa-miR-609, hsa-miR-610, hsa-miR-611, hsa-miR-612, hsa-miR-613, hsa-miR-614, hsa-miR-615-3p, hsa-miR-615-5p, hsa-miR-616, hsa-miR-616*, hsa-miR-617, hsa-miR-618, hsa-miR-619, hsa-miR-620, hsa-miR-621, hsa-miR-622, hsa-miR-623, hsa-miR-624, hsa-miR-624*, hsa-miR-625, hsa-miR-625*, hsa-miR-626, hsa-miR-627, hsa-miR-628-3p, hsa-miR-628-5p, hsa-miR-629, hsa-miR-629*, hsa-miR-630, hsa-miR-631, hsa-miR-632, hsa-miR-633, hsa-miR-634, hsa-miR-635, hsa-miR-636, hsa-miR-637, hsa-miR-638, hsa-miR-639, hsa-miR-640, hsa-miR-641, hsa-miR-642, hsa-miR-643, hsa-miR-644, hsa-miR-645, hsa-miR-646, hsa-miR-647, hsa-miR-648, hsa-miR-649, hsa-miR-650, hsa-miR-651, hsa-miR-652, hsa-miR-653, hsa-miR-654-3p, hsa-miR-654-5p, hsa-miR-655, hsa-miR-656, hsa-miR-657, hsa-miR-658, hsa-miR-659, hsa-miR-660, hsa-miR-661, hsa-miR-662, hsa-miR-663, hsa-miR-663b, hsa-miR-664, hsa-miR-664*, hsa-miR-665, hsa-miR-668, hsa-miR-671-3p, hsa-miR-671-5p, hsa-miR-675, hsa-miR-7, hsa-miR-708, hsa-miR-708*, hsa-miR-7-1*, hsa-miR-7-2*, hsa-miR-720, hsa-miR-744, hsa-miR-744*, hsa-miR-758, hsa-miR-760, hsa-miR-765, hsa-miR-766, hsa-miR-767-3p, hsa-miR-767-5p, hsa-miR-768-3p, hsa-miR-768-5p, hsa-miR-769-3p, hsa-miR-769-5p, hsa-miR-770-5p, hsa-miR-802, hsa-miR-873, hsa-miR-874, hsa-miR-875-3p, hsa-miR-875-5p, hsa-miR-876-3p, hsa-miR-876-5p, hsa-miR-877, hsa-miR-877*, hsa-miR-885-3p, hsa-miR-885-5p, hsa-miR-886-3p, hsa-miR-886-5p, hsa-miR-887, hsa-miR-888, hsa-miR-888*, hsa-miR-889, hsa-miR-890, hsa-miR-891a, hsa-miR-891b, hsa-miR-892a, hsa-miR-892b, hsa-miR-9, hsa-miR-9*, hsa-miR-920, hsa-miR-921, hsa-miR-922, hsa-miR-923, hsa-miR-924, hsa-miR-92a, hsa-miR-92a-1*, hsa-miR-92a-2*, hsa-miR-92b, hsa-miR-92b*, hsa-miR-93, hsa-miR-93*, hsa-miR-933, hsa-miR-934, hsa-miR-935, hsa-miR-936, hsa-miR-937, hsa-miR-938, hsa-miR-939, hsa-miR-940, hsa-miR-941, hsa-miR-942, hsa-miR-943, hsa-miR-944, hsa-miR-95, hsa-miR-96, hsa-miR-96*, hsa-miR-98, hsa-miR-99a, hsa-miR-99a*, hsa-miR-99b, and hsa-miR-99b*.

A miRNA inhibits the function of the mRNAs it targets and, as a result, inhibits expression of the polypeptides encoded by the mRNAs. Thus, blocking (partially or totally) the activity of the miRNA (e.g., silencing the miRNA) can effectively induce, or restore, expression of a polypeptide whose expression is inhibited (derepress the polypeptide). In one embodiment, derepression of polypeptides encoded by mRNA targets of a miRNA is accomplished by inhibiting the miRNA activity in cells through any one of a variety of methods. For example, blocking the activity of a miRNA can be accomplished by hybridization with a small interfering nucleic acid (e.g., antisense oligonucleotide, miRNA sponge, TuD RNA) that is complementary, or substantially complementary to, the miRNA, thereby blocking interaction of the miRNA with its target mRNA. As used herein, an small interfering nucleic acid that is substantially complementary to a miRNA is one that is capable of hybridizing with a miRNA, and blocking the miRNA's activity. In some embodiments, an small interfering nucleic acid that is substantially complementary to a miRNA is an small interfering nucleic acid that is complementary with the miRNA at all but 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, or 18 bases. In some embodiments, an small interfering nucleic acid sequence that is substantially complementary to a miRNA, is an small interfering nucleic acid sequence that is complementary with the miRNA at, at least, one base.

A “miRNA Inhibitor” is an agent that blocks miRNA function, expression and/or processing. For instance, these molecules include but are not limited to microRNA specific antisense, microRNA sponges, tough decoy RNAs (TuD RNAs) and microRNA oligonucleotides (double-stranded, hairpin, short oligonucleotides) that inhibit miRNA interaction with a Drosha complex. MicroRNA inhibitors can be expressed in cells from a transgenes of a rAAV vector, as discussed above. MicroRNA sponges specifically inhibit miRNAs through a complementary heptameric seed sequence (Ebert, M. S. Nature Methods, Epub Aug. 12, 2007;). In some embodiments, an entire family of miRNAs can be silenced using a single sponge sequence. TuD RNAs achieve efficient and long-term-suppression of specific miRNAs in mammalian cells (See, e.g., Takeshi Haraguchi, et al., Nucleic Acids Research, 2009, Vol. 37, No. 6 e43, the contents of which relating to TuD RNAs are incorporated herein by reference). Other methods for silencing miRNA function (derepression of miRNA targets) in cells will be apparent to one of ordinary skill in the art.

In some embodiments, the cloning capacity of the recombinant RNA vector may limited and a desired coding sequence may require the complete replacement of the virus's 4.8 kilobase genome. Large genes may, therefore, not be suitable for use in a standard recombinant AAV vector, in some cases. The skilled artisan will appreciate that options are available in the art for overcoming a limited coding capacity. For example, the AAV ITRs of two genomes can anneal to form head to tail concatamers, almost doubling the capacity of the vector. Insertion of splice sites allows for the removal of the ITRs from the transcript. Other options for overcoming a limited cloning capacity will be apparent to the skilled artisan.

Somatic Transgenic Animal Models Produced Using rAAV-Based Gene Transfer

The disclosure also involves the production of somatic transgenic animal models of disease using recombinant Adeno-Associated Virus (rAAV) based methods. The methods are based, at least in part, on the observation that AAV serotypes and variants thereof mediate efficient and stable gene transfer in a tissue specific manner in adult animals. The rAAV elements (capsid, promoter, transgene products) are combined to achieve somatic transgenic animal models that express a stable transgene in a time and tissue specific manner. The somatic transgenic animal produced by the methods of the disclosure can serve as useful models of human disease, pathological state, and/or to characterize the effects of gene for which the function (e.g., tissue specific, disease role) is unknown or not fully understood. For example, an animal (e.g., mouse) can be infected at a distinct developmental stage (e.g., age) with a rAAV comprising a capsid having a specific tissue targeting capability (e.g., liver, heart, pancreas) and a transgene having a tissue specific promoter driving expression of a gene involved in disease. Upon infection, the rAAV infects distinct cells of the target tissue and produces the product of the transgene.

In some embodiments, the sequence of the coding region of a transgene is modified. The modification may alter the function of the product encoded by the transgene. The effect of the modification can then be studied in vivo by generating a somatic transgenic animal model using the methods disclosed herein. In some embodiments, modification of the sequence of coding region is a nonsense mutation that results in a fragment (e.g., a truncated version). In other cases, the modification is a missense mutation that results in an amino acid substitution. Other modifications are possible and will be apparent to the skilled artisan.

In some embodiments, the transgene causes a pathological state. A transgene that causes a pathological state is a gene whose product has a role in a disease or disorder (e.g., causes the disease or disorder, makes the animal susceptible to the disease or disorder) and/or may induce the disease or disorder in the animal. The animal can then be observed to evaluate any number of aspects of the disease (e.g., progression, response to treatment, etc.). These examples are not meant to be limiting, other aspects and examples are disclosed herein and described in more detail below.

The disclosure in some aspects, provide methods for producing somatic transgenic animal models through the targeted destruction of specific cell types. For example, models of type 1 diabetes can be produced by the targeted destruction of pancreatic Beta-islets. In other examples, the targeted destruction of specific cell types can be used to evaluate the role of specific cell types on human disease. In this regard, transgenes that encode cellular toxins (e.g., diphtheria toxin A (DTA)) or pro-apoptotic genes (NTR, Box, etc.) can be useful as transgenes for functional ablation of specific cell types. Other exemplary transgenes, whose products kill cells are embraced by the methods disclosed herein and will be apparent to one of ordinary skill in the art.

The disclosure in some aspects, provides methods for producing somatic transgenic animal models to study the long-term effects of over-expression or knockdown of genes. The long term over expression or knockdown (e.g., by shRNA, miRNA, miRNA inhibitor, etc.) of genes in specific target tissues can disturb normal metabolic balance and establish a pathological state, thereby producing an animal model of a disease, such as, for example, cancer. The disclosure in some aspects, provides methods for producing somatic transgenic animal models to study the long-term effects of over-expression or knockdown of gene of potential oncogenes and other genes to study tumorigenesis and gene function in the targeted tissues. Useful transgene products include proteins that are known to be associated with cancer and small interfering nucleic acids inhibiting the expression of such proteins.

Other suitable transgenes may be readily selected by one of skill in the art provided that they are useful for creating animal models of tissue-specific pathological state and/or disease.

Recombinant AAV Administration Methods

The rAAVs may be delivered to a subject in compositions according to any appropriate methods known in the art. The rAAV, preferably suspended in a physiologically compatible carrier (e.g., in a composition), may be administered to a subject, e.g., host animal, such as a human, mouse, rat, cat, dog, sheep, rabbit, horse, cow, goat, pig, guinea pig, hamster, chicken, turkey, or a non-human primate (e.g., Macaque). In some embodiments a host animal does not include a human.

Delivery of the rAAVs to a mammalian subject may be by, for example, intramuscular injection or by administration into the bloodstream of the mammalian subject. Administration into the bloodstream may be by injection into a vein, an artery, or any other vascular conduit. In some embodiments, the rAAVs are administered into the bloodstream by way of isolated limb perfusion, a technique well known in the surgical arts, the method essentially enabling the artisan to isolate a limb from the systemic circulation prior to administration of the rAAV virions. A variant of the isolated limb perfusion technique, described in U.S. Pat. No. 6,177,403, can also be employed by the skilled artisan to administer the virions into the vasculature of an isolated limb to potentially enhance transduction into muscle cells or tissue. Moreover, in certain instances, it may be desirable to deliver the virions to the CNS of a subject. By “CNS” is meant all cells and tissue of the brain and spinal cord of a vertebrate. Thus, the term includes, but is not limited to, neuronal cells, glial cells, astrocytes, cerebrospinal fluid (CSF), interstitial spaces, bone, cartilage and the like. Recombinant AAVs may be delivered directly to the CNS or brain by injection into, e.g., the ventricular region, as well as to the striatum (e.g., the caudate nucleus or putamen of the striatum), spinal cord and neuromuscular junction, or cerebellar lobule, with a needle, catheter or related device, using neurosurgical techniques known in the art, such as by stereotactic injection (see, e.g., Stein et al., J Virol 73:3424-3429, 1999; Davidson et al., PNAS 97:3428-3432, 2000; Davidson et al., Nat. Genet. 3:219-223, 1993; and Alisky and Davidson, Hum. Gene Ther. 11:2315-2329, 2000).

The compositions of the disclosure may comprise an rAAV alone, or in combination with one or more other viruses (e.g., a second rAAV encoding having one or more different transgenes). In some embodiments, a composition comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more different rAAVs each having one or more different transgenes.

Suitable carriers may be readily selected by one of skill in the art in view of the indication for which the rAAV is directed. For example, one suitable carrier includes saline, which may be formulated with a variety of buffering solutions (e.g., phosphate buffered saline). Other exemplary carriers include sterile saline, lactose, sucrose, calcium phosphate, gelatin, dextran, agar, pectin, peanut oil, sesame oil, and water. The selection of the carrier is not a limitation of the disclosure.

Optionally, the compositions of the disclosure may contain, in addition to the rAAV and carrier(s), other conventional pharmaceutical ingredients, such as preservatives, or chemical stabilizers. Suitable exemplary preservatives include chlorobutanol, potassium sorbate, sorbic acid, sulfur dioxide, propyl gallate, the parabens, ethyl vanillin, glycerin, phenol, and parachlorophenol. Suitable chemical stabilizers include gelatin and albumin.

RAAVs are administered in sufficient amounts to transfect the cells of a desired tissue and to provide sufficient levels of gene transfer and expression without undue adverse effects. Conventional and pharmaceutically acceptable routes of administration include, but are not limited to, direct delivery to the selected organ (e.g., intraportal delivery to the liver), oral, inhalation (including intranasal and intratracheal delivery), intraocular, intravenous, intramuscular, subcutaneous, intradermal, intratumoral, and other parental routes of administration. Routes of administration may be combined, if desired.

The dose of rAAV virions required to achieve a particular “therapeutic effect,” e.g., the units of dose in genome copies/per kilogram of body weight (GC/kg), will vary based on several factors including, but not limited to: the route of rAAV virion administration, the level of gene or RNA expression required to achieve a therapeutic effect, the specific disease or disorder being treated, and the stability of the gene or RNA product. One of skill in the art can readily determine a rAAV virion dose range to treat a patient having a particular disease or disorder based on the aforementioned factors, as well as other factors that are well known in the art.

An effective amount of an rAAV is an amount sufficient to target infect an animal, target a desired tissue. In some embodiments, an effective amount of an rAAV is an amount sufficient to produce a stable somatic transgenic animal model. The effective amount will depend primarily on factors such as the species, age, weight, health of the subject, and the tissue to be targeted, and may thus vary among animal and tissue. For example, an effective amount of the rAAV is generally in the range of from about 1 ml to about 100 ml of solution containing from about 10⁹ to 10¹⁶ genome copies. In some cases, a dosage between about 10¹¹ to 10¹² rAAV genome copies is appropriate. In certain embodiments, 10¹² rAAV genome copies is effective to target heart, liver, and pancreas tissues. In some cases, stable transgenic animals are produced by multiple doses of an rAAV.

In some embodiments, rAAV compositions are formulated to reduce aggregation of AAV particles in the composition, particularly where high rAAV concentrations are present (e.g., ˜10¹³ GC/ml or more). Methods for reducing aggregation of rAAVs are well known in the art and, include, for example, addition of surfactants, pH adjustment, salt concentration adjustment, etc. (See, e.g., Wright F R, et al., Molecular Therapy (2005) 12, 171-178, the contents of which are incorporated herein by reference.)

Formulation of pharmaceutically-acceptable excipients and carrier solutions is well-known to those of skill in the art, as is the development of suitable dosing and treatment regimens for using the particular compositions described herein in a variety of treatment regimens.

Typically, these formulations may contain at least about 0.1% of the active compound or more, although the percentage of the active ingredient(s) may, of course, be varied and may conveniently be between about 1 or 2% and about 70% or 80% or more of the weight or volume of the total formulation. Naturally, the amount of active compound in each therapeutically-useful composition may be prepared is such a way that a suitable dosage will be obtained in any given unit dose of the compound. Factors such as solubility, bioavailability, biological half-life, route of administration, product shelf life, as well as other pharmacological considerations will be contemplated by one skilled in the art of preparing such pharmaceutical formulations, and as such, a variety of dosages and treatment regimens may be desirable.

In certain circumstances it will be desirable to deliver the rAAV-based therapeutic constructs in suitably formulated pharmaceutical compositions disclosed herein either subcutaneously, intrapancreatically, intranasally, parenterally, intravenously, intramuscularly, intrathecally, or orally, intraperitoneally, or by inhalation. In some embodiments, the administration modalities as described in U.S. Pat. Nos. 5,543,158; 5,641,515 and 5,399,363 (each specifically incorporated herein by reference in its entirety) may be used to deliver rAAVs. In some embodiments, a preferred mode of administration is by portal vein injection.

The pharmaceutical forms suitable for injectable use include sterile aqueous solutions or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersions. Dispersions may also be prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. Under ordinary conditions of storage and use, these preparations contain a preservative to prevent the growth of microorganisms. In many cases the form is sterile and fluid to the extent that easy syringability exists. It must be stable under the conditions of manufacture and storage and must be preserved against the contaminating action of microorganisms, such as bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyol (e.g., glycerol, propylene glycol, and liquid polyethylene glycol, and the like), suitable mixtures thereof, and/or vegetable oils. Proper fluidity may be maintained, for example, by the use of a coating, such as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of surfactants. The prevention of the action of microorganisms can be brought about by various antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of the injectable compositions can be brought about by the use in the compositions of agents delaying absorption, for example, aluminum monostearate and gelatin.

For administration of an injectable aqueous solution, for example, the solution may be suitably buffered, if necessary, and the liquid diluent first rendered isotonic with sufficient saline or glucose. These particular aqueous solutions are especially suitable for intravenous, intramuscular, subcutaneous and intraperitoneal administration. In this connection, a sterile aqueous medium that can be employed will be known to those of skill in the art. For example, one dosage may be dissolved in 1 ml of isotonic NaCl solution and either added to 1000 ml of hypodermoclysis fluid or injected at the proposed site of infusion, (see for example, “Remington's Pharmaceutical Sciences” 15th Edition, pages 1035-1038 and 1570-1580). Some variation in dosage will necessarily occur depending on the condition of the host. The person responsible for administration will, in any event, determine the appropriate dose for the individual host.

Sterile injectable solutions are prepared by incorporating the active rAAV in the required amount in the appropriate solvent with various of the other ingredients enumerated herein, as required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the various sterilized active ingredients into a sterile vehicle which contains the basic dispersion medium and the required other ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile injectable solutions, the preferred methods of preparation are vacuum-drying and freeze-drying techniques which yield a powder of the active ingredient plus any additional desired ingredient from a previously sterile-filtered solution thereof.

The rAAV compositions disclosed herein may also be formulated in a neutral or salt form. Pharmaceutically-acceptable salts, include the acid addition salts (formed with the free amino groups of the protein) and which are formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be derived from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, histidine, procaine and the like. Upon formulation, solutions will be administered in a manner compatible with the dosage formulation and in such amount as is therapeutically effective. The formulations are easily administered in a variety of dosage forms such as injectable solutions, drug-release capsules, and the like.

As used herein, “carrier” includes any and all solvents, dispersion media, vehicles, coatings, diluents, antibacterial and antifungal agents, isotonic and absorption delaying agents, buffers, carrier solutions, suspensions, colloids, and the like. The use of such media and agents for pharmaceutical active substances is well known in the art. Supplementary active ingredients can also be incorporated into the compositions. The phrase “pharmaceutically-acceptable” refers to molecular entities and compositions that do not produce an allergic or similar untoward reaction when administered to a host.

Delivery vehicles such as liposomes, nanocapsules, microparticles, microspheres, lipid particles, vesicles, and the like, may be used for the introduction of the compositions of the disclosure into suitable host cells. In particular, the rAAV vector delivered transgenes may be formulated for delivery either encapsulated in a lipid particle, a liposome, a vesicle, a nanosphere, or a nanoparticle or the like.

Such formulations may be preferred for the introduction of pharmaceutically acceptable formulations of the nucleic acids or the rAAV constructs disclosed herein. The formation and use of liposomes is generally known to those of skill in the art. Recently, liposomes were developed with improved serum stability and circulation half-times (U.S. Pat. No. 5,741,516). Further, various methods of liposome and liposome like preparations as potential drug carriers have been described (U.S. Pat. Nos. 5,567,434; 5,552,157; 5,565,213; 5,738,868 and 5,795,587).

Liposomes have been used successfully with a number of cell types that are normally resistant to transfection by other procedures. In addition, liposomes are free of the DNA length constraints that are typical of viral-based delivery systems. Liposomes have been used effectively to introduce genes, drugs, radiotherapeutic agents, viruses, transcription factors and allosteric effectors into a variety of cultured cell lines and animals. In addition, several successful clinical trials examining the effectiveness of liposome-mediated drug delivery have been completed.

Liposomes are formed from phospholipids that are dispersed in an aqueous medium and spontaneously form multilamellar concentric bilayer vesicles (also termed multilamellar vesicles (MLVs). MLVs generally have diameters of from 25 nm to 4 μm. Sonication of MLVs results in the formation of small unilamellar vesicles (SUVs) with diameters in the range of 200 to 500 .ANG., containing an aqueous solution in the core.

Alternatively, nanocapsule formulations of the rAAV may be used. Nanocapsules can generally entrap substances in a stable and reproducible way. To avoid side effects due to intracellular polymeric overloading, such ultrafine particles (sized around 0.1 μm) should be designed using polymers able to be degraded in vivo. Biodegradable polyalkyl-cyanoacrylate nanoparticles that meet these requirements are contemplated for use.

In addition to the methods of delivery described above, the following techniques are also contemplated as alternative methods of delivering the rAAV compositions to a host. Sonophoresis (e.g., ultrasound) has been used and described in U.S. Pat. No. 5,656,016 as a device for enhancing the rate and efficacy of drug permeation into and through the circulatory system. Other drug delivery alternatives contemplated are intraosseous injection (U.S. Pat. No. 5,779,708), microchip devices (U.S. Pat. No. 5,797,898), ophthalmic formulations (Bourlais et al., 1998), transdermal matrices (U.S. Pat. Nos. 5,770,219 and 5,783,208) and feedback-controlled delivery (U.S. Pat. No. 5,697,899).

Kits and Related Compositions

The agents described herein may, in some embodiments, be assembled into pharmaceutical or diagnostic or research kits to facilitate their use in therapeutic, diagnostic or research applications. A kit may include one or more containers housing the components of the disclosure and instructions for use. Specifically, such kits may include one or more agents described herein, along with instructions describing the intended application and the proper use of these agents. In certain embodiments agents in a kit may be in a pharmaceutical formulation and dosage suitable for a particular application and for a method of administration of the agents. Kits for research purposes may contain the components in appropriate concentrations or quantities for running various experiments.

The kit may be designed to facilitate use of the methods described herein by researchers and can take many forms. Each of the compositions of the kit, where applicable, may be provided in liquid form (e.g., in solution), or in solid form, (e.g., a dry powder). In certain cases, some of the compositions may be constitutable or otherwise processable (e.g., to an active form), for example, by the addition of a suitable solvent or other species (for example, water or a cell culture medium), which may or may not be provided with the kit. As used herein, “instructions” can define a component of instruction and/or promotion, and typically involve written instructions on or associated with packaging of the disclosure. Instructions also can include any oral or electronic instructions provided in any manner such that a user will clearly recognize that the instructions are to be associated with the kit, for example, audiovisual (e.g., videotape, DVD, etc.), Internet, and/or web-based communications, etc. The written instructions may be in a form prescribed by a governmental agency regulating the manufacture, use or sale of pharmaceuticals or biological products, which instructions can also reflects approval by the agency of manufacture, use or sale for animal administration.

The kit may contain any one or more of the components described herein in one or more containers. As an example, in one embodiment, the kit may include instructions for mixing one or more components of the kit and/or isolating and mixing a sample and applying to a subject. The kit may include a container housing agents described herein. The agents may be in the form of a liquid, gel or solid (powder). The agents may be prepared sterilely, packaged in syringe and shipped refrigerated. Alternatively it may be housed in a vial or other container for storage. A second container may have other agents prepared sterilely. Alternatively the kit may include the active agents premixed and shipped in a syringe, vial, tube, or other container. The kit may have one or more or all of the components required to administer the agents to an animal, such as a syringe, topical application devices, or iv needle tubing and bag, particularly in the case of the kits for producing specific somatic animal models.

In some cases, the methods involve transfecting cells with total cellular DNAs isolated from the tissues that potentially harbor proviral AAV genomes at very low abundance and supplementing with helper virus function (e.g., adenovirus) to trigger and/or boost AAV rep and cap gene transcription in the transfected cell. In some cases, RNA from the transfected cells provides a template for RT-PCR amplification of cDNA and the detection of novel AAVs. In cases where cells are transfected with total cellular DNAs isolated from the tissues that potentially harbor proviral AAV genomes, it is often desirable to supplement the cells with factors that promote AAV gene transcription. For example, the cells may also be infected with a helper virus, such as an Adenovirus or a Herpes Virus. In a specific embodiment, the helper functions are provided by an adenovirus. The adenovirus may be a wild-type adenovirus, and may be of human or non-human origin, preferably non-human primate (NHP) origin. Similarly adenoviruses known to infect non-human animals (e.g., chimpanzees, mouse) may also be employed in the methods of the disclosure (See, e.g., U.S. Pat. No. 6,083,716). In addition to wild-type adenoviruses, recombinant viruses or non-viral vectors (e.g., plasmids, episomes, etc.) carrying the necessary helper functions may be utilized. Such recombinant viruses are known in the art and may be prepared according to published techniques. See, e.g., U.S. Pat. Nos. 5,871,982 and 6,251,677, which describe a hybrid Ad/AAV virus. A variety of adenovirus strains are available from the American Type Culture Collection, Manassas, Va., or available by request from a variety of commercial and institutional sources. Further, the sequences of many such strains are available from a variety of databases including, e.g., PubMed and GenBank.

Cells may also be transfected with a vector (e.g., helper vector) which provides helper functions to the AAV. The vector providing helper functions may provide adenovirus functions, including, e.g., E1a, E1b, E2a, E4ORF6. The sequences of adenovirus gene providing these functions may be obtained from any known adenovirus serotype, such as serotypes 2, 3, 4, 7, 12 and 40, and further including any of the presently identified human types known in the art. Thus, in some embodiments, the methods involve transfecting the cell with a vector expressing one or more genes necessary for AAV replication, AAV gene transcription, and/or AAV packaging.

In some cases, a novel isolated capsid gene can be used to construct and package recombinant AAV vectors, using methods well known in the art, to determine functional characteristics associated with the novel capsid protein encoded by the gene. For example, novel isolated capsid genes can be used to construct and package recombinant AAV (rAAV) vectors comprising a reporter gene (e.g., B-Galactosidase, GFP, Luciferase, etc.). The rAAV vector can then be delivered to an animal (e.g., mouse) and the tissue targeting properties of the novel isolated capsid gene can be determined by examining the expression of the reporter gene in various tissues (e.g., heart, liver, kidneys) of the animal. Other methods for characterizing the novel isolated capsid genes are disclosed herein and still others are well known in the art.

The kit may have a variety of forms, such as a blister pouch, a shrink wrapped pouch, a vacuum sealable pouch, a sealable thermoformed tray, or a similar pouch or tray form, with the accessories loosely packed within the pouch, one or more tubes, containers, a box or a bag. The kit may be sterilized after the accessories are added, thereby allowing the individual accessories in the container to be otherwise unwrapped. The kits can be sterilized using any appropriate sterilization techniques, such as radiation sterilization, heat sterilization, or other sterilization methods known in the art. The kit may also include other components, depending on the specific application, for example, containers, cell media, salts, buffers, reagents, syringes, needles, a fabric, such as gauze, for applying or removing a disinfecting agent, disposable gloves, a support for the agents prior to administration etc.

The instructions included within the kit may involve methods for detecting a latent AAV in a cell. In addition, kits of the disclosure may include, instructions, a negative and/or positive control, containers, diluents and buffers for the sample, sample preparation tubes and a printed or electronic table of reference AAV sequence for sequence comparisons.

The following Examples are meant to illustrate, but in no way limit the claimed invention.

EXAMPLES Example 1: A Novel AAV Capsid Library

Unlike peptide phage display libraries where the tissue resident phages are amplified by infection of bacteria, the multiple rounds of in vivo selection using AAV capsid libraries require PCR amplification of the entire cap gene followed by high-efficiency cloning and re-packaging. Each iteration of this process takes weeks to months. Therefore, we decided to generate a new AAV capsid library of higher diversity and based on a larger number of wild type AAV capsids, namely AAV1, AAV2, AAV4, AAV5, AAV6, AAV8, AAV9, AAVrh8, AAVrh10, AAVrh39, and AAVrh43. In addition to increasing the diversity and complexity of the new library we wanted to also investigate a new approach for in vivo selection of AAV capsid libraries by using unique barcodes in the 3′UTR of each clone in the library and deep sequencing to determine their frequency in target cells/tissue followed by PCR amplification of capsid/barcode pairs from the plasmid library (see FIG. 1).

Generation of Barcoded Capsid Library

Inclusion of unique barcodes in lentivirus-shRNA libraries dramatically accelerated genome-wide screens of gene function and pathway discovery. In these libraries with <100,000 clones, unique barcodes have been used and the relation between barcode and shRNA well characterized by high-throughput sequencing. Moreover a unique set of 240,000 microarray compatible barcodes have been developed. For the present application with a library of 2×10⁷ clones, it was impossible to utilize such approach.

Therefore we used a constrained randomized 30-mer (NNM)io designed to eliminate specific restriction site from the barcode. The number of possible barcodes in this configuration is 1.1×10¹⁵. We synthesized the equivalent of ˜10¹² barcodes and inserted it in the 3′ untranslated region of the chimeric AAV genomes. Given the difference in size between the library (2×10⁷) and the synthesized barcode oligonucleotide (˜10¹²), it is likely that each cap gene is linked to a different barcode. We have produced this new AAV capsid library and PCR amplified AAV genomes from the purified virion stock. Sequencing has shown that each clone has an individual and unrelated barcode (Table 1), and that all amplified AAV genomes are chimeric (FIG. 2). Thus far we have not observed evidence of a bias in the packaged chimeric AAV genomes.

TABLE 1 Exemplary AAV capsid library barcodes Clone Barcodes SEQ Nb. ID NO. S4 GTCTCATGCTATTGCTTCGCATCGTCAGAC 17 S5 GCAGGAGTCTTCGGTGCAGTTTGTTACGCA 18 S6 TGATTGGACTAGTCGTGGGAGTTAGCATGC 19 S9 TTATCTGGCGTGTAGGCAGCGGCCTCAGAC 20 S10 TTATACGACTCATATGTGGCATACGTCGTG 21 S11 GGAGCCGAGGAAGACTTGTCCGGGGACTGG 22 S12 GCCGGGGTGTTGGTCTTTGTCTACGATGGC 23

This one-step in vivo AAV capsid selection approach relies on the ability to analyze simultaneously millions of sequences by deep sequencing and thus assess the frequency of individual bar codes in target cells/tissues. The second component that we will be optimizing throughout the process is how to identify the capsid sequences that correspond to each barcode. It is not feasible to sequence every single clone in a library of 2×10⁷ clones by traditional methods, and deep sequencing of such large DNA molecules would require breaking them into smaller fragments, which would lead to the loss of information on the chimeric nature of these new cap genes. Therefore we will utilize the sequence variation in the barcodes to design PCR-based strategies to amplify the cap gene corresponding to each barcode using the plasmid library as template.

Cell Culture Testing of AAV Capsid Library

We performed experiments in human U87 glioma cells with AAV library doses of 10, 10², 10⁵ genome copies (gc)/target cell. Cells were exposed to the library for 4 hrs. followed by extensive washing and incubation for 3 days. Barcodes were PCR amplified from crude cell lysates using primers specific for constant regions immediately flanking them. The PCR products were cloned into a TOPO-TA cloning plasmid and individual barcodes sequenced for a number of clones for each library dose. We found evidence of clonal dominance for all library doses, but especially at the lowest dose where 24 out of 25 clones were a single barcode (FIG. 3A-3B). The experiment was repeated with similar results of clonal dominance at the lowest library dose, albeit different barcodes were identified. These results suggest that lower AAV library doses may lead to increased stringency and identification of the most infectious clones without deep sequencing.

In Vivo Testing

Next we tested the validity of this principle in wild type mice. For this purpose we infused the AAV library via the tail vein of 6-8 week-old C57BL/6 mice at doses of 1E⁹, 1E¹⁰, 1E¹¹, and 5E¹¹ gc (one mouse per dose). Three days later we collected cerebrum, cerebellum, spinal cord, liver, skeletal muscle and heart from each mouse, and isolated genomic DNA. We have conducted the barcode survey in cerebrum and liver, as these two organs present very different transduction profiles for most gene transfer vectors, including AAV.

In the liver of AAV library-infused mice we found no evidence of barcode dominance at 5E¹¹, but a considerable number of seemingly over-represented clones at 1E⁹ (FIG. 3C, top panel). In contrast, in the brain we were unable to recover any barcodes at 1E⁹, but observed clear over-representation of certain barcodes at 5E¹¹ (FIG. 3C, bottom panel). These results are consistent with the observation that the liver is often the main depot of AAV vector genomes after systemic delivery, while the brain often has only a small percentage of the total dose, if at all. Therefore it appears that the selection stringency in vivo is dependent on library dose and target tissue. Deep sequencing of barcodes may allow identification of over-represented barcodes in all organs for the same library dose.

These results in cell culture and in vivo suggest that our principle of single-round selection/panning may be sufficient to identify novel chimeric AAV capsids with tropism for the targets of choice. These results also suggest that lower AAV doses lead to increased selection and stringency. The presence of barcodes as a diagnostic tag is critical, as the ability to estimate viral diversity and biases by deep sequencing simply does not exist for the earlier published non-barcoded AAV capsid libraries due to the chimeric nature of DNA-shuffled libraries. Deep sequencing of tissue barcode could also help bypass potential biases inherent to low-throughput sequencing screens.

In Vivo Identification of Brain-Targeting Chimeric AAVs from the Barcoded Capsid Library

Traditionally, in-vivo selection of AAV libraries from target tissue has been done by either PCR amplification of full-length chimeric cap genes enriched in target tissues, or by adenoviral rescue. As an alternative strategy to our single-round in-vivo selection strategy using the barcodes as ‘fishing’ tags, we decided to investigate whether we could also recover enriched full-length cap genes from total DNA isolates of target tissues in a single round. An important thing to note here is although this approach does not require the use of barcodes as ‘fishing’ tags, they still serve to distinguish unique capsid sequences from each other; even chimeric capsid genes that differ by a single nucleotide can be readily distinguished from each other by the presence of unique barcodes.

The AAV capsid library generated above was packaged to generate a high titer viral stock and intravenously infused at two doses of 1E11 and 5E11 vg via tail vein into adult C57BL/6 mice. We performed nested PCR amplification of the full-length cap genes (including the barcode region) from total DNA isolates from brain and liver of AAV-library infused C57BL/6 mice (as mentioned earlier), using cap gene-flanking universal primers specific for constant regions across all chimeric AAV cap genes. These caps were cloned using a cloning vector and sequenced. Analogous to the aforementioned barcode dominance, evidence of dosage-dependent cap gene dominance was observed. (FIG. 4).

PCR amplification of library-derived AAV capsids from brain genomic DNA isolated 3 days post infusion resulted in the identification of four chimeric capsids. These were named AAV-B1 (SEQ ID NO: 1 and 5), AAV-B2 (SEQ ID NO: 2 and 6), AAV-B3 (SEQ ID NO: 3 and 7) and AAV-B4 (SEQ ID NO: 4 and 8). Native capsid contributions to AAV-B1, AAV-B2, AAV-B3 and AAV-B4 are shown in FIGS. 19A-FIG. 22D. AAV-B1 was the only capsid isolated from the brain of 1E11 vg infused mouse and AAV-B2, AAV-B3 and AAV-B4 were the three capsids identified from the higher dose. All four capsids isolated from brain share remarkable protein sequence homology (shown in FIG. 16-16B) despite being diverse at the nucleotide level (FIG. 4 and FIG. 15A-15D). This, coupled with the diversity of capsids isolated from the liver of the same mice, suggests selection of functional domains necessary for transducing the brain.

Four chimeric capsids targeting liver were also identified. These were named AAV-L1 (SEQ ID NO: 9 and 13), AAV-L2 (SEQ ID NO: 10 and 14), AAV-L3 (SEQ ID NO: 11 and 15) and AAV-L4 (SEQ ID NO: 12 and 16). All four capsids isolated from brain share remarkable protein sequence homology (shown in FIG. 18A-18B) despite being diverse at the nucleotide level (FIG. 17A-17D).

Our next goal was to determine whether these brain-selected AAV capsids demonstrate brain tropism after systemic intravenous infusion in adult mice. Self-complementary recombinant AAV vectors (rAAV-B1-B4) encoding EGFP were produced using these capsids, and also AAV9 as a control given its exceptional CNS tropism after iv infusion in neonatal and adult mice. rAAV vectors were infused into adult C57BL/6 mice at a dose of 4.6×10¹¹ vector genomes via the tail vein (N=3 mice per rAAV vector). Brain, spinal cord and peripheral tissues were harvested at 4 weeks post-injection and analyzed for GFP expression by immunofluorescence staining.

All newly selected AAV vectors transduced cells in the brain and spinal cord at levels comparable to AAV9 (FIG. 5 and FIG. 6). rAAV-B1-derived rAAV vector transduced the CNS endothelium at high efficiency. rAAV-B2-B4 also transduced brain endothelial cells at high efficiency, but similar to our findings for AAV9 at this dose most transduced cells in the parenchyma appeared to be glia. The degree of glia transduction for rAAV-B2 vector appeared to be considerably superior to that observed with rAAV9 vector. The other three vectors, rAAV-B3 and rAAV-B4 also transduced glia at high efficiency. Interestingly liver transduction was considerably lower for rAAV-B1 and rAAV-B2 compared to rAAV9 (FIG. 5), but higher with rAAV-B3 and rAAV-B4.

The efficient gene transfer in the CNS in the presence of non-compromised blood-brain barrier after systemic iv delivery of DNA shuffled AAV viral vectors, coupled with significantly reduced liver transduction is novel and has not been previously observed, to the best of our knowledge. Furthermore, only one round of selection was shown to be enough for efficient selection.

Higher doses of the two most promising vectors, AAV-B1 and AAV-B2, were used to further study their biodistribution profiles in adult mice. Infusion of a higher dose (2E¹² vg) of rAAV-B1 and rAAV-B2 resulted in transduction of cortical and hippocampal neurons, endothelia and astrocytes (FIG. 7). Importantly the liver transduction efficiency for rAAV-B1 and rAAV-B2 vectors was dramatically lower than AAV9, as observed by fluorescence imaging (FIG. 8).

The biodistribution of rAAV-B1, rAAV-B2, and AAV9 vectors was analyzed by quantitative PCR (qPCR) of vector genomes in genomic DNA from different tissues and GFP expression levels by Western blotting. Analysis was performed 4 weeks post i.v. infusion of scAAV-CBA-GFP vector via the tail vein in adult C57BL/6 mice at a dose of 5E¹¹ vg. Significantly more vector genomes were found in the cerebrum, cerebellum and spinal cord of rAAV-B1, compared to AAV9 (9-fold, 15-fold and 6-fold, respectively) (FIG. 9). However, both rAAV-B1 and rAAV-B2 had significantly lower vector genomes in the liver in comparison to AAV9 (4-fold and 3-fold, respectively). These observations were confirmed by detection of GFP by Western blot analysis of total tissue lysates. More GFP protein was detected in the cerebrum of adult C57BL/6 mice injected with rAAV-B1 and rAAV-B2, compared to AAV9 (FIG. 10). In contrast, less GFP protein was detected in the liver of rAAV-B1 and rAAV-B2-injected mice, in comparison to AAV9.

In addition, rAAV-B2, and to a greater extent, rAAV-B1, transduced peripheral tissues to a greater extent than AAV9 (FIG. 11 and FIG. 12). Significantly more vector genomes were found in the forelimb muscle, hind limb muscle, heart and pancreas of both rAAV-B1 and rAAV-B2-injected mice, as compared to AAV9, with the extent of transduction of rAAV-B1 being greater than rAAV-B2 for each of these tissues. In addition, rAAV-B1 had significantly more vector genomes in kidney and lung in comparison to AAV9.

rAAV-B1-GFP and rAAV-B2-GFP vectors were infused into 4-6 week-old kittens (n=1/vector) via the carotid artery at a dose of 3.5E¹² vg. Neuronal and astrocytic populations (identified by morphology) were found to be transduced in both cat brains (FIG. 13). While rAAV-B2 has pronounced astrocytic transduction, the number of astrocytes transduced is much lower for rAAV-B1. Loci with high density of neurons were found transduced in rAAV-B1 cat brain. No endothelial transduction was observed for either vector in the feline brain.

In Vitro Identification of Cap-Barcode Pairs

Although we have now isolated novel AAV capsids targeted to broader tissues such as the CNS by full-length capsid gene PCR, this traditional method may be limited by the low number of AAV genomes that are capable of crossing the blood brain barrier to transduce neurons at high efficiency. It has been previously shown that libraries derived from PCR of smaller amplicon size contain more unique sequences and higher diversity estimates, than those derived from larger amplicons. Therefore, single-round in-vivo selection strategy of amplifying only the barcodes, instead of the entire capsid gene will thus allow us to amplify even the scarce viral genomes that may result from low frequency events, such as AAV capsids in the library capable of crossing the BBB to transduce neurons at high efficiency. This is supported by our observation that while we could pull barcodes from 1E⁹ and 1E¹⁰ brains, we were unable to isolate full-length capsid genes from brain at doses lower than 1E¹¹.

The ability to link barcodes to their unique capsid genes is key to the success of this novel approach. We now report that we have successfully developed a method to link a barcode to its corresponding capsid gene. Initially, we pulled out rAAV-B1 using its barcode for our screen, as the capsid gene and barcode for the rAAV-B1 clone were fully sequenced and thus we knew the capsid-barcode relation. Using high-density colony hybridization with a radioactively-labeled rAAV-B1 barcode-specific oligo, we were successful in identifying the rAAV-B1 capsid gene after screening a secondary plasmid library generated by PCR amplification of AAV genomes present in the packaged library. Depicted in FIG. 14 is a schematic representation of the process used to identify the rAAV-B1 capsid gene.

Example 2

FIG. 23 shows CNS transduction profile of AAV-B1 vector after intravascular infusion in adult mice. FIG. 23A provides an overview of GFP distribution in brains of AAV-B1 and AAV9 injected mice (2×10¹² vg/mouse). Representative images of coronal brain sections located at +0.5 mm, −1.80 mm and −3.00 mm (left to right) in relation to bregma are shown. Transduction of neuronal populations in different CNS regions of AAV-B1 injected mice (2×10¹² vg/mouse) is shown in FIG. 23B. Black arrows indicate examples of GFP-positive neurons identified by morphology. Bar=50 μm. FIG. 23C shows AAV vector genome content in cerebrum, cerebellum and spinal cords (N=4 animals per group) (5×10¹¹ vg/mouse). Age matched non-injected mice were included as controls (not shown). FIG. 23D shows Western blot analysis of GFP expression in cerebrum, cerebellum and spinal cord of 2 animals per group (5×10¹¹ vg/mouse). Signal intensity of GFP was normalized to corresponding β-actin signal intensity for quantitative comparison. **p<0.01, ***p<0.001, ****p<0.0001 by Student's unpaired t-test.

FIG. 24A-24B shows neuronal transduction in cat after systemic delivery of AAV-B1 vectors. Transduction of neurons in the cat brain after systemic delivery of AAV-B1 vector (3.4×10¹² vg). Representative images show GFP-positive cells with neuronal morphology in various structures in the brain. Bar=50 μm.

FIG. 25 shows AAV-B1 biodistribution to mouse liver and muscle after intravascular delivery. Native GFP expression in (FIG. 25A) livers of AAV-B1 and AAV9 injected mice (2×10¹² vg/mouse) and (FIG. 25D) skeletal muscle (triceps and quadriceps), diaphragm and heart of AAV-B1 injected mice (5×10¹¹ vg/mouse). AAV vector genome content (N=4 animals per group) (FIG. 25B, liver; FIG. 25E muscle groups) and Western blot analysis of GFP expression (N=2 animals per group) (FIG. 25C, liver; FIG. 25F muscle groups) are shown (5×10¹¹ vg/mouse). Signal intensity of GFP was normalized to corresponding β-actin signal intensity for quantitative comparison. *p<0.05, **p<0.01, by Student's unpaired t-test.

FIG. 26 shows transduction of mouse beta cells, alveolar epithelium and retinal vasculature after intravenous infusion of AAV-B1. GFP expression in FIG. 26A pancreas, FIG. 26D lung, and FIG. 26G retina of AAV-B1 injected mice (5×10¹¹ vg/mouse). White arrow in FIG. 26A indicates GFP-positive insulin-producing beta cells. Inset in FIG. 26G shows individual GFP-positive blood vessels. AAV vector genome content (N=4 animals per group) (FIG. 26B, pancreas; FIG. 26E lung) and FIG. 26C. Western blot analysis of GFP expression (N=2 animals per group) (FIG. 26C, pancreas; FIG. 26E lung) are shown. Signal intensity of GFP was normalized to corresponding β-actin signal intensity for quantitative comparison. *p<0.05, ****p<0.0001 by Student's unpaired t-test.

FIG. 27 shows biophysical characterization of AAV-B1. FIG. 27A shows a predicted molecular model of AAV-B1 capsid. FIG. 27B shows a surface exposed variable region-IV (VR-IV) of AAV-B1 (left) and AAV8 (right). FIG. 27C shows pooled human IVIg neutralization assay and FIG. 27D shows a CHO cell binding assay of AAV-B1 and AAV9 vectors. Data shown as mean±SEM in FIG. 27C, and as mean±SD in FIG. 27D. Experiment was performed with N=3 biological replicates. *p<0.05 by one-way ANOVA.

FIG. 28 provides a schematic cartoon of one embodiment of a single round in-vivo biopanning strategy

FIG. 29 shows the chimeric nature of packaged viral library. FIG. 30A shows parental capsid contribution to clones 1-5 isolated at random from packaged viral library. FIG. 30B shows homology between the clones at the amino acid level. Grey areas indicate homology; black lines indicate non-homologous amino acids. % homology is calculated for amino acid composition.

FIG. 30 shows the chimeric and homologous nature of brain-resident capsids. Parental capsid contribution to (FIG. 30A) brain-selected capsids AAV-B1, -B2, -B3 and -B4, and (FIG. 30B) 4 liver-resident variants chosen at random. Homology among (FIG. 30C) brain and (FIG. 30D) liver clones at the amino acid level. Grey areas indicate homology; black lines indicate non-homologous amino acids. % homology is calculated for amino acid composition.

FIG. 31 shows a transduction profile of AAV-B1 vector across multiple CNS regions after systemic delivery. Black arrows indicate examples of GFP-positive neurons. Bar=50 μm.

FIG. 32 shows phenotyping of GFP positive cells in CNS after systemic delivery of AAV-B1. Transduced cells were identified by double immunofluorescence staining with antibodies to GFP, pan-neuronal marker NeuN (FIG. 32A, FIG. 32B and FIG. 32D), striatal medium spiny neuron marker DARPP32 (FIG. 32A), dopaminergic neuron marker tyrosine hydroxylase (TH) (FIG. 32B), Purkinje neuron marker calbindin-D-28k (Cal28K) (FIG. 32C), endothelial marker CD31 (FIG. 32E), and mature oligodendrocyte marker APC (FIG. 32F). The large size, morphology and location of GFP-positive neurons in the ventral spinal cord suggest a motor neuron identity. GFP-positive astrocytes (FIG. 32G) were identified based on their morphology. White arrows indicate examples of co-localization. Bar=10 μm.

FIG. 33A-33E shows a comparison of AAV-B1 capsid protein sequence to AAV8 and other natural AAV isolates. The top two lines highlight the similarities and differences between amino acid sequences of AAV8 and AAV-B1, with singleton residue variants relative to AAV8 highlighted in red. The residue highlighted in green indicates a variant residue with no corresponding orthologs in the nine AAV species presented in the alignment. Translation start sites for VP1, VP2, and VP3 are indicated with filled triangles. The conserved parvovirus phospholipase A2 domain (approximately residues 44 to 104) in the VP1 unique region with the conserved AAV calcium-binding motif (Y-X-G-P-G/F) and catalytic residues (H-D-X-X-Y) are indicated with filled rectangles. The secondary structural elements are labeled with the corresponding text and the following symbols: β sheets B, C, D, E, F, G, and I are indicated with a horizontal overlined arrow. The positions of the variable loops (VR), I through IX, are indicated. The position of the conserved α helix is indicated with three parallel horizontal lines.

FIG. 34 shows a comparison of biodistribution profile of AAV-B1 and AAV8 after systemic delivery (5×10¹¹ vg/mouse). AAV vector genome content (N=4 animals per group) in (FIG. 34A) CNS, (FIG. 34B) liver and (FIG. 34C) skeletal muscle (quadriceps) is shown. *p<0.05, **p<0.01, ***p<0.001, ****p<0.0001 by Student's unpaired t-test.

FIG. 35 shows biodistribution profile of AAV-B1 systemically infused at lower dose (5×10¹⁰ vg/mouse). AAV vector genome content (N=4 animals per group) in (FIG. 35A) CNS, (FIG. 35B) muscle groups and (FIG. 35C) peripheral tissues is shown. *p<0.05, **p<0.01, by Student's unpaired t-test.

SEQUENCE LISTING >SEQ ID NO: 1; AAV-B1 ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATT CGCGAGTGGTGGGACTTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAA AGCAGGACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTC AACGGACTCGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGC ACGACAAGGCCTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTAT AACCACGCCGACGTCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGG CAACCTCGGGCGAGCAGTCTTCCAGGCCAAGAAGAGGGTTCTCGAACCTTTTGGTC TGGTTGAGGAAGGCGCTAAGACGGCTCCTGGAAAGAAACGTCCGGTAGAGCAGTC GCCACAAGAGCCAGACTCCTCCTCGGGCATCGGCAAGAAAGGCCAACAGCCCGCCA GAAAAAGACTCAATTTTGGCCAGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAA CCTCTCGGAGAACCTCCAGCAGCGCCCTCTGGTGTGGGACCTAATACAATGGCTGC AGGCGGTGGCGCACCAATGGCAGACAATAACGAGGGCGCCGACGGAGTGGGTAAT TCCTCGGGAAATTGGCATTGCGATTCCACATGGATGGGGGACAGAGTCATCACCAC CAGCACCCGAACCTGGGCCCTGCCCACCTACAACAACCACCTCTACAAGCAAATCT CCAACGGCACCTCGGGAGGAAGCACCAACGACAACACCTATTTTGGCTACAGCACC CCCTGGGGGTATTTTGATTTCAACAGATTCCACTGCCACTTCTCACCACGTGACTGG CAGCGACTCATCAACAACAACTGGGGATTCCGGCCAAAAAGACTCAGCTTCAAGCT CTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATCGCCA ATAACCTTACCAGCACGATTCAGGTATTTACGGACTCGGAATACCAGCTGCCATAC GTCCTCGGCTCCGCGCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTCTTCATG ATTCCCCAGTACGGCTACCTTACACTGAACAATGGAAGTCAAGCCGTAGGCCGTTC CTCCTTCTACTGCCTGGAATATTTTCCTTCTCAAATGCTGAGAACGGGCAACAACTT TGAGTTCAGCTACCAGTTTGAGGACGTGCCTTTTCACAGCAGCTACGCGCACAGCCA AAGCCTGGACCGGCTGATGAACCCCCTCATCGACCAGTACCTGTACTACCTGTCTCG GACTCAGTCCACGGGAGGTACCGCAAGAACTCAGCAGTTGCTATTTTCTCAGGCCG GGCCTAATACAATGGCCAATCAGGCAAAGAACTGGCTGCCAGGACCCTGTTACCGC CAACAACGCGTCTCAACGACAACCGGGCAAAACAACAATAGCAACTCTGCCTGGAC TGCTGGGACCAAATACCATCTGAATGGAAGAAATTCATTGGCTAATCCTGGCATCG CTATGGCAACACACAAAGACGACGAGGAGCGTTTTTTTCCCAGTAACGGGATCCTG ATTTTTGGCAAACAAAATGCTGCCAGAGACAATGCGGATTACAGCGATGTCATGCT CACCAGCGAGGAAGAAATCAAAACCACTAACCCTGTGGCTACAGAGGAATACGGT ATCGTGGCAGATAACTTGCAGCAGCAAAACACGGCTCCTCAAATTGGAACTGTCAA CAGCCAGGGGGCCTTACCCGGTATGGTCTGGCAGAACCGGGACGTGTACCTGCAGG GTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTCCACCCATCTCCGCTGA TGGGCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATCCTGATCAAGAACACGCCTG TACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCTGAACTCTTTCATCACGC AATACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGCTACAGAAGGAAAA CAGCAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAACTACTACAAATCTACAA GTGTGGACTTTGCTGTTAATACAGAAGGCGTGTACTCTGAACCCCGCCCCATTGGCA CCCGTTACCTCACCCGTAATCTGTAA >SEQ ID NO: 2; AAV-B2 ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATT CGCGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAA AGCAGGACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTC AACGGACTCGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGC ACGACAAGGCCTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTAT AACCACGCCGACGCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGG CAACCTCGGGCGAGCAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTC TGGTTGAGGACGGCGCTAAGACGGCTCCTGGAAAGAAGAGACCAGTAGAGCAGTC ACCCCAAGAACCAGACTCCTCCTCGGGCATCGGCAAGAAAGGCCAACAGCCCGCCA GAAAAAGACTCAATTTTGGCCAGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAA CCTCTCGGAGAACCTCCAGCAACCCCCGCTGCTGTGGGACCTACTACAATGGCTTCA GGCGGTGGCGCACCAATGGCAGACAATAACGAAGGCGCCGACGGAGTGGGTAGTT CCTCGGGAAATTGGCATTGCGATTCCACATGGCTGGGCGACAGAGTCATCACCACC AGCACCCGAACCTGGGCCCTGCCCACCTACAACAACCACCTCTACAAGCAAATCTC CAACGGGACATCGGGAGGAGCCACCAACGACAACACCTACTTCGGCTACAGCACCC CCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACCACGTGACTGGC AGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCTTCAAGCTC TTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATCGCCAA TAACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAGCTGCCGTACGT TCTCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTGTTCATGAT TCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCCT CCTTCTACTGCCTGGAATACTTTCCTTCGCAGATGCTGAGAACCGGCAACAACTTCC AGTTTACTTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGA GCTTGGACCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTGTCTCGGA CTCAAACAACAGGAGGCACGGCAAATACGCAGACTCTGGGCTTCAGCCAAGGTGG GCCTAATACAATGGCCAATCAGGCAAAGAACTGGCTGCCAGGACCCTGTTACCGCC AACAACGCGTCTCAACGACAACCGGGCAAAACAACAATAGCAACTTTGCCTGGACT GCTGGGACCAAATACCATCTGAATGGAAGAAATTCATTGGCTAATCCTGGCATCGC TATGGCAACACACAAGGACGACGAGGAGCGTTTTTTTCCCAGTAACGGGATCCTGA TTTTTGGCAAACAAAATGCTGCCAGAGACAATGCGGATTACAGCGATGTCATGCTC ACCAGCGAGGAAGAAATCAAAACCACTAACCCTGTGGCTACAGAGGAATACGGTA TCGTGGCAGATAACTTGCAGCAGCAAAACACGGCTCCTCAAATTGGAACTGTCAAC AGCCAGGGGGCCTTACCCGGTATGGTCTGGCAGAACCGGGACGTGTACCTGCAGGG TCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTCCACCCGTCTCCGCTGAT GGGCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATCCTGATCAAGAACACGCCTGT ACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCTGAACTCTTTCATCACGCA ATACAGCACCGGACAGGTCAGCGTGGAAATCGAGTGGGAGCTGCAGAAAGAAAAC AGCAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAACTACTACAAATCTACAAA TGTGGACTTTGCTGTCAACACGGAGGGGGTTTATAGCGAGCCTCGCCCCATTGGCAC CCGTTACCTCACCCGCAACCTGTAA >SEQ ID NO: 3; AAV-B3 ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATT CGCGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAA AGCAGGACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTC AACGGACTCGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGC ACGACAAGGCCTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTAT AACCACGCCGACGCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGG CAACCTCGGGCGAGCAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTC TGGTTGAGGAAGGCGCTAAGACGGCTCCTGGAAAGAAACGTCCGGTAGAGCAGTC GCCACAAGAGCCAGACTCCTCCTCGGGCATCGGCAAGACAGGCCAGCAGCCCGCTA AAAAGAGGCTCAATTTTGGTCAGACTGGCGACTCAGAGTCAGTCCCCGACCCACAA CCTCTCGGAGAACCTCCAGCAACCCCCGCTGCTGTGGGACCTACTACAATGGCTTCA GGCGGTGGCGCACCAATGGCAGACAATAACGAAGGCGCCGACGGAGTGGGTAATG CCTCAGGAAATTGGCATTGCGATTCCACATGGCTGGGCGACAGANTCATCACCACC AGCACCCGAACATGGGCCTTGCCCACCTATAACAACCACCTCTACAAGCAAATCTC CAGTGCTTCAACGGGGGCCAGCAACGACAACCACTACTTCGGCTACAGCACCCCCT GGGGGTATTTTGATTTCAACAGATTCCACTGCCATTTCTCACCACGTGACTGGCAGC GACTCATCAACAACAATTGGGGATTCCGGCCCAAGAGACTCAACTTCAAGCTCTTC AACATCCAAGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATCGCCAATA ACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAGCTGCCGTACGTTC TCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTGTTCATGATTC CCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCCTCC TTCTACTGCCTGGAATATTTTCCATCTCAAATGCTGCGAACTGGAAACAATTTTGAA TTCAGCTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCACACAGCCAGAG CTTGGACCGACTGATGAATCCTCTCATCGACCAGTACCTGTACTACTTGTCTCGGAC TCAAACAACAGGAGGCACGGCAAATACGCAGACTCTGGGCTTCAGCCAAGGTGGG CCTAATACAATGGCCAATCAGGCAAAGAACTGGCTGCCAGGACCCTGTTACCGCCA ACAACGCGTCTCAACGACAACCGGGCAAAACAACAATAGCAACTTTGCCTGGACTG CTGGGACCAAATACCATCTGAATGGAAGAAATTCATTGGCTAATCCTGGCATCGCT ATGGCAACACACAAAGACGACGAGGAGCGTTTTTTTCCCAGTAACGGGATCCTGAT TTTTGGCAAACAAAATGCTGCCAGAGACAATGCGGATTACAGCGATGTCATGCTCA CCAGCGAGGAAGAAATCAAAACCACTAACCCTGTGGCTACAGAGGAATACGGTATC GTGGCAGATAACTTGCAGCAGCAAAACACGGCTCCTCAAATTGGAACTGTCAACAG CCAGGGGACCTTACCCGGTATGGTCTGGCAGAACCGGGACGTGTACCTGCAGGGTC CCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTCCACCCGTCTCCGCTGATGG GCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATCCTGATCAAGAACACGCCTGTAC CTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCTGAACTCTTTCATCACGCAAT ACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGCTACAGAAGGAAAACAG CAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAATTACTACAAATCTACAAGTG TGGACTTTGCTGTTAATACAGAAGGCGTGTACTCTGAACCCCGCCCCATTGGCACGC GTTTCCTCACCCGTAATCTGTAA >SEQ ID NO: 4; AAV-B4 ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATT CGCGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAA AGCAGGACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTC AACGGACTCGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGC ACGACAAGGCCTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTAT AACCACGCCGACGCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGG CAACCTCGGGCGAGCAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTC TGGTTGAGGAAGGCGCTAAGACGGCTCCTGGAAAGAAACGTCCGGTAGAGCAGTC GCCACAAGAGCCAGACTCCTCCTCGGGCATCGGCAAGACAGGCCAGCAGCCCGCTA AAAAGAGGCTCAATTTTGGCCAGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAA CCTCTCGGAGAACCTCCAGCAGCGCCCTCTGGTGTGGGACCTAATACAATGGCTGC AGGCGGTGGCGCACCAATGGCAGACAATAACGAAGGCGCCGACGGAGTGGGTAGT TCCTCGGGAAATTGGCATTGCGATTCCACATGGCTGGGCGACAGAGTCATCACCAC CAGCACCCGAACCTGGGCCCTGCCCACCTACAACAACCACCTCTACAAGCAAATCT CCAACGGGACATCGGGAGGAGCCACCAACGACAACACCTACTTCGGCTACAGCACC CCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACCACGTGACTGG CAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCTTCAAGCT CTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATCGCCA ATAACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAGCTGCCGTAC GTACTAGGATCCGCTCACCAGGGATGTCTGCCTCCGTTCCCGGCGGACGTGTTCATG ATTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTC CTCCTTCTACTGCCTGGAATATTTTCCATCTCAGATGCTGAGAACGGGCAATAACTT TACCTTCAGCTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTATGCCCACAGCCA GAGCTTGGACCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTGTCTCG GACTCAAACAACAGGAGGCACGGCAAATACGCAGACTCTGGGCTTCAGCCAAGGT GGGCCTAATACAATGGCCAATCAGGCAAAGAACTGGCTGCCAGGACCCTGTTACCG CCAACAACGCGTCTCAACGACAACCGGGCAAAACAACAATAGCAACTTTGCCTGGA CTGCTGGGACCAAATACCATCTGAATGGAAGAAATTCATTGGCTAATCCTGGCATC GCTATGGCAACACACAAAGACGACGAGGAGCGTTTTTTTCCCAGTAACGGGATCCT GATTTTTGGCAAACAAAATGCTGCCAGAGACAATGCGGATTACAGCGATGTCATGC TCACCAGCGAGGAAGAAATCAAAACCACTAACCCTGTGGCTACAGAGGAATACGGT ATCGTGGCAGATAACTTGCAGCAGCAAAACACGGCTCCTCAAATTGGAACTGTCAA CAGCCAGGGGGCCTTACCCGGTATGGTCTGGCAGAACCGGGACGTGTACCTGCAGG GTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTCCACCCGTCTCCGCTAA TGGGAGGATTTGGACTGAAGCACCCACCTCCTCAGATCCTGATCAAGAACACGCCG GTACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCTGTACTCTTTCATCACG CAATACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGCTGCAGAAGGAAA ACAGCAAGCGCTGGAACCCCGAGATCCAATACACCTCCAACTACTACAAATCTACA AGTGTGGACTTTGCTGTTAATACAGAAGGCGTGTACTCTGAACCCCGCCCCATTGGC ACCCGTTACCTCACCCGTAATCTGTAA >SEQ ID NO: 5; AAV-B1 MAADGYLPDWLEDNLSEGIREWWDLKPGAPKPKANQQKQDDGRGLVLPGYKYLGPF NGLDKGEPVNAADAAALEHDKAYDQQLKAGDNPYLRYNHADVEFQERLQEDTSFGG NLGRAVFQAKKRVLEPFGLVEEGAKTAPGKKRPVEQSPQEPDSSSGIGKKGQQPARKR LNFGQTGDSESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGNSSGN WHCDSTWMGDRVITTSTRTWALPTYNNHLYKQISNGTSGGSTNDNTYFGYSTPWGYF DFNRFHCHFSPRDWQRLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQ VFTDSEYQLPYVLGSAHQGCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPS QMLRTGNNFEFSYQFEDVPFHSSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTARTQQ LLFSQAGPNTMANQAKNWLPGPCYRQQRVSTTTGQNNNSNSAWTAGTKYHLNGRNS LANPGIAMATHKDDEERFFPSNGILIFGKQNAARDNADYSDVMLTSEEEIKTTNPVATE EYGIVADNLQQQNTAPQIGTVNSQGALPGMVWQNRDVYLQGPIWAKIPHTDGNFHPSP LMGGFGLKHPPPQILIKNTPVPADPPTTFNQSKLNSFITQYSTGQVSVEIEWELQKENSKR WNPEIQYTSNYYKSTSVDFAVNTEGVYSEPRPIGTRYLTRNL >SEQ ID NO: 6; AAV-B2 MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPF NGLDKGEPVNAADAAALEHDKAYDQQLKAGDNPYLRYNHADAEFQERLQEDTSFGG NLGRAVFQAKKRVLEPLGLVEDGAKTAPGKKRPVEQSPQEPDSSSGIGKKGQQPARKR LNFGQTGDSESVPDPQPLGEPPATPAAVGPTTMASGGGAPMADNNEGADGVGSSSGN WHCDSTWLGDRVITTSTRTWALPTYNNHLYKQISNGTSGGATNDNTYFGYSTPWGYF DFNRFHCHFSPRDWQRLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQ VFTDSEYQLPYVLGSAHQGCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPS QMLRTGNNFQFTYTFEDVPFHSSYAHSQSLDRLMNPLIDQYLYYLSRTQTTGGTANTQT LGFSQGGPNTMANQAKNWLPGPCYRQQRVSTTTGQNNNSNFAWTAGTKYHLNGRNS LANPGIAMATHKDDEERFFPSNGILIFGKQNAARDNADYSDVMLTSEEEIKTTNPVATE EYGIVADNLQQQNTAPQIGTVNSQGALPGMVWQNRDVYLQGPIWAKIPHTDGNFHPSP LMGGFGLKHPPPQILIKNTPVPADPPTTFNQSKLNSFITQYSTGQVSVEIEWELQKENSKR WNPEIQYTSNYYKSTNVDFAVNTEGVYSEPRPIGTRYLTRNL >SEQ ID NO: 7; AAV-B3 MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPF NGLDKGEPVNAADAAALEHDKAYDQQLKAGDNPYLRYNHADAEFQERLQEDTSFGG NLGRAVFQAKKRVLEPLGLVEEGAKTAPGKKRPVEQSPQEPDSSSGIGKTGQQPAKKR LNFGQTGDSESVPDPQPLGEPPATPAAVGPTTMASGGGAPMADNNEGADGVGNASGN WHCDSTWLGDRXITTSTRTWALPTYNNHLYKQISSASTGASNDNHYFGYSTPWGYFDF NRFHCHFSPRDWQRLINNNWGFRPKRLNFKLFNIQVKEVTQNEGTKTIANNLTSTIQVF TDSEYQLPYVLGSAHQGCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQM LRTGNNFEFSYTFEDVPFHSSYAHSQSLDRLMNPLIDQYLYYLSRTQTTGGTANTQTLG FSQGGPNTMANQAKNWLPGPCYRQQRVSTTTGQNNNSNFAWTAGTKYHLNGRNSLA NPGIAMATHKDDEERFFPSNGILIFGKQNAARDNADYSDVMLTSEEEIKTTNPVATEEY GIVADNLQQQNTAPQIGTVNSQGTLPGMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLM GGFGLKHPPPQILIKNTPVPADPPTTFNQSKLNSFITQYSTGQVSVEIEWELQKENSKRW NPEIQYTSNYYKSTSVDFAVNTEGVYSEPRPIGTRFLTRNL >SEQ ID NO: 8; AAV-B4 MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPF NGLDKGEPVNAADAAALEHDKAYDQQLKAGDNPYLRYNHADAEFQERLQEDTSFGG NLGRAVFQAKKRVLEPLGLVEEGAKTAPGKKRPVEQSPQEPDSSSGIGKTGQQPAKKR LNFGQTGDSESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGN WHCDSTWLGDRVITTSTRTWALPTYNNHLYKQISNGTSGGATNDNTYFGYSTPWGYF DFNRFHCHFSPRDWQRLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQ VFTDSEYQLPYVLGSAHQGCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPS QMLRTGNNFTFSYTFEDVPFHSSYAHSQSLDRLMNPLIDQYLYYLSRTQTTGGTANTQT LGFSQGGPNTMANQAKNWLPGPCYRQQRVSTTTGQNNNSNFAWTAGTKYHLNGRNS LANPGIAMATHKDDEERFFPSNGILIFGKQNAARDNADYSDVMLTSEEEIKTTNPVATE EYGIVADNLQQQNTAPQIGTVNSQGALPGMVWQNRDVYLQGPIWAKIPHTDGNFHPSP LMGGFGLKHPPPQILIKNTPVPADPPTTFNQSKLYSFITQYSTGQVSVEIEWELQKENSKR WNPEIQYTSNYYKSTSVDFAVNTEGVYSEPRPIGTRYLTRNL >SEQ ID NO: 9; AAV-L1 ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATT CGCGAGTGGTGGGACTTGAAACCTGGAGCCCCGAAACCCAAAGCCAACCAGCAAA AGCAGGACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTC AACGGACTCGACAAGGGGGAGCCCGTCAACGCGGCGGATGCAGCGGCCCTCGAGC ACGACAAAGCCTACGACCGGCAGCTCGACAGCGGAGACAACCCGTACCTCAAGTAC AACCACGCCGACGCGGAGTTTCAGGAGCGCCTTAAAGAAGATACGTCTTTTGGGGG CAACCTCGGACGAGCAGTCTTCCAGGCCAAGAAGAGGGTTCTCGAACCTTTTGGTC TGGTTGAGGAAGGTGCTAAGACGGCTCCTGGAAAGAAACGTCCGGTAGAGCAGTCG CCACAAGAGCCAGACTCCTCCTCGGGCATTGGCAAGACAGGCCAGCAGCCCGCTAA AAAGAGACTCAATTTTGGTCAGACTGGCGACTCAGAGTCAGTCCCCGACCCACAAC CTCTCGGAGAACCTCCAGCAACCCCCGCTGCTGTGGGACCTACTACAATGGCTTCAG GTGGTGGCGCACCAGTGGCAGACAATAACGAAGGCGCCGACGGAGTGGGTAGTTC CTCGGGAAATTGGCATTGCGATTCCACATGGCTGGGCGACAGAGTCATCACCACCA GCACCCGAACATGGGCCTTGCCCACCTATAACAACCACCTCTACAAGCAAATCTCC AGTGCTTCGACGGGGGCCAGCAACGACAACCACTACTTCGGCTACAGCACCCCCTG GGGGTATTTTGATTTCAACAGGTTCCACTGCCATTTCTCACCACGTGACTGGCAGCG ACTCATCAACAACAATTGGGGATTCCGGCCCAAAAGACTCAACTTCAAGCTGTTCA ACATCCAGGTCAAGGAAGTCACGACGAACGAAGGCACCAAGACCATCGCCAATAA TCTCACCAGCACCGTGCAGGTCTTTACGGACTCGGAGTACCAGTTACCGTACGTGCT AGGATCCGCTCACCAGGGATGTCTGCCTCCGTTCCCGGCGGACGTCTTCATGGTTCC TCAGTACGGCTATTTAACTTTAAACAATGGAAGCCAAGCCCTGGGACGTTCCTCCTT CTACTGTCTGGAGTATTTCCCATCGCAGATGCTGAGAACGGGCAACAACTTTACCTT TAGCTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCGCACAGCCAGAGCC TGGACAGGCTGATGAATCCCCTCATCGACCAGTACCTGTACTACCTGGTCAGAACG CAAACGACTGGAACTGGAGGGACGCAGACTCTGGCATTCAGCCAAGCGGGTCCTAG CTCAATGGCCAACCAGGCTAGAAATTGGGTGCCCGGACCTTGCTACCGGCAGCAGC GCGTCTCCACGACAACCAACCAGAACAACAACAGCAACTTTGCCTGGACGGGAGCT GCCAAGTTTAAGCTGAACGGCCGAGACTCTCTAATGAATCCGGGCGTGGCAATGGC TTCCCACAAGGATGACGACGACCGCTTCTTCCCTTCGAGCGGGGTCCTGATTTTTGG CAAGCAAGGAGCCGGGAACGATGGAGTGGATTACAGCCAAGTGCTGATTACAGAT GAGGAAGAAATCAAGGCTACCAACCCCGTGGCCACAGAAGAATATGGAGCAGTGG CCATCAACAACCAGGCCGCCAATACGCAGGCGCAGACCGGACTCGTGCACAACCAG GGGGTGATTCCCGGCATGGTGTGGCAGAATAGAGACGTGTACCTGCAGGGTCCCAT CTGGGCCAAAATTCCTCACACGGACGGCAACTTTCACCCGTCTCCCCTGATGGGCGG CTTTGGACTGAAGCACCCGCCTCCTCAAATTCTCATCAAGAACACACCGGTTCCAGC GGACCCGCCGCTTACCTTCAACCAGGCCAAGCTGAACTCTTTCATCACGCAGTACAG CACCGGACAGGTCAGCGTGGAAATCGAGTGGGAGCTGCAGAAAGAAAACAGCAAA CGCTGGAATCCAGAGATTCAATACACTTCCAACTACTACAAATCTACAAATGTGGA CTTTGCTGTCAACACGGAGGGGGTTTATAGCGAGCCTCGCCCCATTGGCACCCGTTA CCTCACCCGCAACCTGTAA >SEQ ID NO: 10; AAV-L2 ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACACTCTCTCTGAAGGAATA AGACAGTGGTGGAAGCTCAAACCTGGCCCACCACCACCAAAGCCCGCAGAGCGGC ATAAGGACGACAGCAGGGGTCTTGTGCTTCCTGGCTACAAGTACCTCGGACCCTTC AACGGACTCGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGACCCTCGAGC ACGACAAGGCCTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTAT AACCACGCCGACGCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGG CNACCTCGGGCGAGCAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTC TGGTTGAGGAAGCTGCTAAGACGGCTCCTGGAAAGAAGAGACCAGTAGAGCCATC ACCCCAGCGTTCTCCAGACTCCTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCG CCAGAAAAAGACTCAATTTTGGTCAGACTGGCGACTCAGAGTCAGTTCCAGACCCT CAACCTCTCGGAGAACCTCCAGCAGCGCCCTCTGGTGTGGGACCTAATACAATGGC TGCAGGCGGTGGCGCACCAATGGCAGACAATAACGAAGGCGCCGACGGAGTGGGT AGTTCCTCGGGAAATTGGCATTGCGATTCCACATGGCTGGGCGACAGAGTCATCAC CACCAGCACCCGAACCTGGGCCCTGCCCACCTACAACAACCACCTCTACAAGCAAA TCTCCAGTGCTTCAACGGGGGCCAGCAACGACAACCACTACTTCGGCTACAGCACC CCCTGGGGGTATTTTGATTTCAACAGATTCCACTGCCATTTCTCACCACGTGACTGG CAGCGACTCATCAACAACAATTGGGGATTCCGGCCCAAGAGACTCAACTTCAAGCT CTTCAACATCCAAGTCAAGGAGGTCACGACGAATGATGGCGTCACGACCATCGCTA ATAACCTTACCAGCACGGTTCAAGTCTTCTCGGACTCAGACTATCAGCTCCCGTACG TGCTCGGGTCGGCTCACGAGGGCTGCCTCCCGCCGTTCCCAGCGGACGTTTTCATGA TTCCTCAGTACGGCTATTTAACTTTAAACAATGGAAGCCAAGCCCTGGGACGTTCCT CCTTCTACTGTCTGGAGTATTTCCCATCGCAGATGCTGAGAACCGGCAACAACTTTC AGTTCAGCTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCGCACAGCCAG AGCCTGGACAGGCTGATGAATCCCCTCATCGACCAGTACCTGTACTACCTGGTCAG AACGCAAACGACTGGAACTGGAGGGACGCAGACTCTGGCATTCAGCCAAGCGGGT CCTAGCTCAATGGCCAACCAGGCTAGAAATTGGGTGCCCGGACCTTGCTACCGGCA GCAGCGCGTCTCCACGACAACCAACCAGAACAACAACAGCAACTTTGCCTGGACGG GAGCTGCCAAGTTTAAGCTGAACGGCCGAGACTCTCTAATGAATCCGGGCGTGGCA ATGGCTTCCCACAAGGATGACGACGACCGCTTCTTCCCTTCGAGCGGGGTCCTGATT TTTGGCAAGCAAGGAGCCGGGAACGATGGAGTGGATTACAGCCAAGTGCTGATTAC AGATGAGGAAGAAATCAAGGCTACCAACCCCGTAGCCACAGAAGAATATGGAGCA GTGGCCATCAACAACCAGGCCGCCAATACGCAGGCGCAGACCGGACTCGTGCACAA CCAGGGGGTGATTCCCGGAATGGTGTGGCAAGACAGAGACGTATACCTGCAGGGTC CTATTTGGGCCAAAATTCCTCACACGGATGGACACTTTCACCCGTCTCCTCTCATGG GCGGCTTTGGACTTAAGCACCCGCCTCCTCAGATCCTCATCAAAAACACGCCTGTTC CTGCGAATCCTCCGGCAGAGTTTTCGGCTACAAAGTTTGCTTCATTCATCACCCAGT ATTCCACAGGACAAGTGAGCGTGGAGATTGAATGGGAGCTGCAGAAAGAAAACAG CAAACGCTGGAATCCCGAAGTGCAGTATACATCTAACTATGCAAAATCTGCCAACG TTGATTTCACTGTGGACAACAATGGACTTTATACTGAGCCTCGCCCCATTGGCACCC GTTACCTCACCCGTCCCCTGTAA >SEQ ID NO: 11; AAV-L3 ATGACTGACGGTTACCTTCCAGATTGGCTAGAGGACAACCTCTCTGAAGGCGTTCG AGAGTGGTGGGCGCTGCAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAG CAGGACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAA CGGACTCGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCAC GACAAGGCCTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTATAA CCACGCCGACGCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCA NCCTCGGGCGAGCAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTG GTTGAGGAAGGCGCTAAGACGGCTCCTGGAAAGAAACGTCCGGTAGAGCCATCACC CCAGCGTTCTCCAGACTCCTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCA GAAAAAGACTCAATTTTGGTCAGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAA CCTCTCGGAGAACCTCCAGCAGCGCCCTCAGGTGTGGGATCTCTTACAATGGCTTCA GGTGGTGGCGCACCAGTGGCAGACAATAACGAAGGTGCCGATGGAGTGGGTAGTTC CTCGGGAAATTGGCATTGCGATTCCCAATGGCTGGGGGACAGAGTCATCACCACCA GCACCCGAACCTGGGCCCTGCCCACCTACAACAATCACCTCTACAAGCAAATCTCC AACAGCACATCTGGAGGATCTTCAAATGACAACGCCTACTTCGGCTACAGCACCCC CTGGGGGTATTTTGACTTCAACAGATTCCACTGCCACTTCTCACCACGTGACTGGCA GCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCTTCAAGCTCT TCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATCGCCAAT AACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAGCTGCCGTACGTT CTCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTGTTCATGATT CCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCCTC CTTCTACTGCCTGGAATACTTTCCTTCGCAGATGCTGAGAACCGGCAACAACTTCCA GTTTACTTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGAG CTTGGACCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTGTCTCGGAC TCAAACAACAGGAGGCACGGCAAATACGCAGACTCTGGGCTTCAGCCAAGGTGGG CCTAATACAATGGCCAATCAGGCAAAGAACTGGCTGCCAGGACCCTGTTACCGCCA ACAACGCGTCTCAACGACAACCGGGCAAAACAACAATAGCAACTTTGCCTGGACTG CTGGGACCAAATACCATCTGAATGGAAGAAATTCATTGGCTAATCCTGGCATCGCT ATGGCAACACACAAAGACGACGAGGAGCGTTTTTTTCCCAGTAACGGGATCCTGAT TTTTGGCAAACAAAATGCTGCCAGAGACAATGCGGATTACAGCGATGTCATGCTCA CCAGCGAGGAAGAAATCAAAACCACTAACCCTGTGGCTACAGAGGAATACGGTATC GTGGCAGATAACTTGCAGCAGCAAAACACGGCTCCTCAAATTGGAACTGTCAACAG CCAGGGGGCCTTACCCGGTATGGTCTGGCAGAACCGGGACGTGTACCTGCAGGGTC CCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTCCACCCGTCTCCGCTGATGG GCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATCCTGATCAAGAACACGCCTGTAC CTGCGGATCCTCCGACCACCTTAAACCAGGCCAAGCTGAACTCTTTCATCACGCAAT ACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGCTGCAGAAGGAAAACAG CAAGCGCTGGAACCCAGAGATTCAGTACACTTCAAACTACTACAAATCTACAAATG TGGACTTTGCTGTCAATACAGAGGGAACTTATAGTGAACCCCGCCCCATTGGCACC AGATTTCTGACTCGTAATCTGTAA >SEQ ID NO: 12; AAV-L4 ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACACTCTCTCTGAAGGAATA AGACAGTGGTGGAAGCTCAAACCTGGCCCACCACCACCAAAGCCCGCAGAGCGGC ATAAGGACGACAGCAGGGGTCTTGTGCTTCCTGGGTACAAGTACCTCGGCCCCTTC AACGGACTCGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGC ACGACAAGGCCTACGACCAGCAGCTCGAAGCGGGTGACAATCCGTACCTGCGGTAT AACCACGCCGACGCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGG CANCCTCGGGCGAGCAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTC TGGTTGAGGAAGGCGCTAAGACGGCTCCTGGAAAGAAGAGACCAGTAGAGCAGTC ACCCCAAGAACCAGACTCCTCCTCGGGCATCGGCAGGAAAGGCCAACAGCCCGCCA GAAAAAGACTCAATTTTGGCCAGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAA CCTCTCGGAGAACCTCCAGCAGCGCCCTCTGGTGTGGGACCTAATACAATGGCTTCA GGCGGTGGCGCTCCAATGGCAGACAATAACGAAGGCGCCGACGGAGTGGGTAATG CCTCAGGAAATTGGCATTGCGATTCCACATGGCTGGGCGACAGAGTCATCACCACC AGCACCCGCACCTGGGCCTTGCCCACCTACAATAACCACCTCTACAAGCAAATCTCC AGTGCTTCAACGGGGGCCAGCAACGACAACCACTACTTCGGCTACAGCACCCCCTG GGGGTATTTTGATTTCAACAGATTCCACTGCCACTTTTCACCACGTGACTGGCAGCG ACTCATCAACAACAATTGGGGATTCCGGCCCAAGAGACTCAACTTCAAACTCTTCA ACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACTAAGACCATCGCCAATAA CCTTACCAGCACGATTCAGGTATTTACGGACTCGGAATACCAGCTGCCGTACGTCCT CGGCTCCGCGCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTCTTCATGATTCC CCAGTACGGCTACCTTACACTGAACAATGGAAGTCAAGCCGTAGGCCGTTCCTCCTT CTACTGCCTGGAATATTTTCCATCTCAAATGCTGCGAACTGGCAACAACTTCCAGTT TACCTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGAGCTT GGACCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTGTCTCGGACTCA GTCCACAGGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGC AAACATGTCGGCTCAGGCTAAGAACTGGCTACCTGGACCTTGCTACCGGCAGCAGC GAGTCTCTACGACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCC ACCAAATATCACCTGAACGGAAGAGACTCTTTGGTAAATCCCGGTGTCGCCATGGC AACCCACAAGGACGACGAGGAACGCTTCTTCCCGTCGAGTGGAGTCCTGATGTTTG GAAAACAGGGTGCTGGAAGAGACAATGTGGACTACAGCAGCGTTATGCTAACCAG CGAAGAAGAAATTAAAACCACTAACCCTGTAGCCACAGAACAATACGGTGTGGTGG TTGATAACTTGCAGCAAACCAATACGGGGCCTATTGTGGGAAATGTCAACAGCCAA GGAGCCTTACCTGGTATGGTCTGGCAGAACCGGGACGTGTACCTGCAGGGTCCCAT CTGGGCCAAGATTCCTCACACGGACGGCAACTTCCACCCGTCTCCGCTGATGGGCG GCTTTGGCCTGAAACATCCTCCGCCTCAGATCCTCATCAAAAACACACCTGTACCTG CGGATCCTCCAACGGCCTTCAACAAGGACAAGCTGAACTCTTTCATCACCCAGTATT CTACTGGCCAAGTCAGCGTGGAGATCGAGTGGGAGCTGCAGAAGGAAAACAGCAA GCGCTGGAACCCGGAGATCCAGTACACTTCCAACTATTACAAGTCTAATAATGTTG AATTTGCTGTTAATACTGAAGGTGTATATAGTGAACCCCGCCCCATTGGCACCAGAT ACCTGACTCGTAATCTGTAA >SEQ ID NO: 13; AAV-L1 MAADGYLPDWLEDNLSEGIREWWDLKPGAPKPKANQQKQDDGRGLVLPGYKYLGPF NGLDKGEPVNAADAAALEHDKAYDRQLDSGDNPYLKYNHADAEFQERLKEDTSFGG NLGRAVFQAKKRVLEPFGLVEEGAKTAPGKKRPVEQSPQEPDSSSGIGKTGQQPAKKRL NFGQTGDSESVPDPQPLGEPPATPAAVGPTTMASGGGAPVADNNEGADGVGSSSGNW HCDSTWLGDRVITTSTRTWALPTYNNHLYKQISSASTGASNDNHYFGYSTPWGYFDFN RFHCHFSPRDWQRLINNNWGFRPKRLNFKLFNIQVKEVTTNEGTKTIANNLTSTVQVFT DSEYQLPYVLGSAHQGCLPPFPADVFMVPQYGYLTLNNGSQALGRSSFYCLEYFPSQM LRTGNNFTFSYTFEDVPFHSSYAHSQSLDRLMNPLIDQYLYYLVRTQTTGTGGTQTLAF SQAGPSSMANQARNWVPGPCYRQQRVSTTTNQNNNSNFAWTGAAKFKLNGRDSLMN PGVAMASHKDDDDRFFPSSGVLIFGKQGAGNDGVDYSQVLITDEEEIKATNPVATEEYG AVAINNQAANTQAQTGLVHNQGVIPGMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLM GGFGLKHPPPQILIKNTPVPADPPLTFNQAKLNSFITQYSTGQVSVEIEWELQKENSKRW NPEIQYTSNYYKSTNVDFAVNTEGVYSEPRPIGTRYLTRNL >SEQ ID NO: 14; AAV-L2 MAADGYLPDWLEDTLSEGIRQWWKLKPGPPPPKPAERHKDDSRGLVLPGYKYLGPFN GLDKGEPVNAADAATLEHDKAYDQQLKAGDNPYLRYNHADAEFQERLQEDTSFGGXL GRAVFQAKKRVLEPLGLVEEAAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLN FGQTGDSESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWH CDSTWLGDRVITTSTRTWALPTYNNHLYKQISSASTGASNDNHYFGYSTPWGYFDFNR FHCHFSPRDWQRLINNNWGFRPKRLNFKLFNIQVKEVTTNDGVTTIANNLTSTVQVFSD SDYQLPYVLGSAHEGCLPPFPADVFMIPQYGYLTLNNGSQALGRSSFYCLEYFPSQMLR TGNNFQFSYTFEDVPFHSSYAHSQSLDRLMNPLIDQYLYYLVRTQTTGTGGTQTLAFSQ AGPSSMANQARNWVPGPCYRQQRVSTTTNQNNNSNFAWTGAAKFKLNGRDSLMNPG VAMASHKDDDDRFFPSSGVLIFGKQGAGNDGVDYSQVLITDEEEIKATNPVATEEYGA VAINNQAANTQAQTGLVHNQGVIPGMVWQDRDVYLQGPIWAKIPHTDGHFHPSPLMG GFGLKHPPPQILIKNTPVPANPPAEFSATKFASFITQYSTGQVSVEIEWELQKENSKRWNP EVQYTSNYAKSANVDFTVDNNGLYTEPRPIGTRYLTRPL >SEQ ID NO: 15; AAV-L3 MTDGYLPDWLEDNLSEGVREWWALQPGAPKPKANQQKQDDGRGLVLPGYKYLGPFN GLDKGEPVNAADAAALEHDKAYDQQLKAGDNPYLRYNHADAEFQERLQEDTSFGGX LGRAVFQAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRL NFGQTGDSESVPDPQPLGEPPAAPSGVGSLTMASGGGAPVADNNEGADGVGSSSGNW HCDSQWLGDRVITTSTRTWALPTYNNHLYKQISNSTSGGSSNDNAYFGYSTPWGYFDF NRFHCHFSPRDWQRLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQVFT DSEYQLPYVLGSAHQGCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQML RTGNNFQFTYTFEDVPFHSSYAHSQSLDRLMNPLIDQYLYYLSRTQTTGGTANTQTLGF SQGGPNTMANQAKNWLPGPCYRQQRVSTTTGQNNNSNFAWTAGTKYHLNGRNSLAN PGIAMATHKDDEERFFPSNGILIFGKQNAARDNADYSDVMLTSEEEIKTTNPVATEEYGI VADNLQQQNTAPQIGTVNSQGALPGMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMG GFGLKHPPPQILIKNTPVPADPPTTLNQAKLNSFITQYSTGQVSVEIEWELQKENSKRWN PEIQYTSNYYKSTNVDFAVNTEGTYSEPRPIGTRFLTRNL >SEQ ID NO: 16; AAV-L4 MAADGYLPDWLEDTLSEGIRQWWKLKPGPPPPKPAERHKDDSRGLVLPGYKYLGPFN GLDKGEPVNAADAAALEHDKAYDQQLEAGDNPYLRYNHADAEFQERLQEDTSFGGXL GRAVFQAKKRVLEPLGLVEEGAKTAPGKKRPVEQSPQEPDSSSGIGRKGQQPARKRLNF GQTGDSESVPDPQPLGEPPAAPSGVGPNTMASGGGAPMADNNEGADGVGNASGNWH CDSTWLGDRVITTSTRTWALPTYNNHLYKQISSASTGASNDNHYFGYSTPWGYFDFNR FHCHFSPRDWQRLINNNWGFRPKRLNFKLFNIQVKEVTQNEGTKTIANNLTSTIQVFTDS EYQLPYVLGSAHQGCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRT GNNFQFTYTFEDVPFHSSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQ AGPANMSAQAKNWLPGPCYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRDSLVNPG VAMATHKDDEERFFPSSGVLMFGKQGAGRDNVDYSSVMLTSEEEIKTTNPVATEQYGV VVDNLQQTNTGPIVGNVNSQGALPGMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMG GFGLKHPPPQILIKNTPVPADPPTAFNKDKLNSFITQYSTGQVSVEIEWELQKENSKRWN PEIQYTSNYYKSNNVEFAVNTEGVYSEPRPIGTRYLTRNL >SEQ ID NO: 17; Barcode S4 GTCTCATGCTATTGCTTCGCATCGTCAGAC >SEQ ID NO: 18; Barcode S5 GCAGGAGTCTTCGGTGCAGTTTGTTACGCA >SEQ ID NO: 19; Barcode S6 TGATTGGACTAGTCGTGGGAGTTAGCATGC >SEQ ID NO: 20; Barcode S9 TTATCTGGCGTGTAGGCAGCGGCCTCAGAC >SEQ ID NO: 21; Barcode S10 TTATACGACTCATATGTGGCATACGTCGTG >SEQ ID NO: 22; Barcode S11 GGAGCCGAGGAAGACTTGTCCGGGGACTGG >SEQ ID NO: 23; Barcode S12 GCCGGGGTGTTGGTCTTTGTCTACGATGGC 

1. A chimeric AAV capsid protein comprising distinct polypeptide regions derived from at least three different AAV serotypes selected from the group consisting of: AAV1, AAV2, AAV4, AAV5, AAV6, AAV8, AAV9, AAVrh8, AAVrh10, AAVrh39, and AAVrh43.
 2. The chimeric AAV capsid protein of claim 1, wherein the chimeric AAV capsid protein comprises distinct polypeptide regions of greater than six amino acids in length that are derived from at least two different AAV serotypes selected from the group consisting of: AAV1, AAV2, AAV4, AAV5, AAV6, AAV8, AAV9, AAVrh8, AAVrh10, AAVrh39, and AAVrh43.
 3. The chimeric AAV capsid protein of claim 1, wherein the amino acid sequence of the protein is represented by SEQ ID NO: 13-16.
 4. The chimeric AAV capsid protein of claim 3, wherein the chimeric AAV capsid is encoded by a sequence as set forth in any one of SEQ ID NO: 9-12.
 5. A rAAV comprising the capsid of claim
 1. 6. The rAAV of claim 5, wherein the rAAV targets the liver tissue of a subject.
 7. The rAAV of claim 6, further comprising a transgene, wherein the transgene is a liver-associated gene.
 8. (canceled)
 9. A barcoded chimeric AAV capsid library comprising a plurality of nucleic acids, wherein each nucleic acid: (i) encodes a unique chimeric AAV capsid protein; and (ii) comprises a unique barcode sequence, wherein barcode sequences in the library have the formula: (NNM)n, wherein n is an integer in the range of is 5 to 20, and wherein N are each nucleotides that are independently selected from A, G, T and C, and wherein M is a nucleotide that is either A or C.
 10. The barcoded chimeric AAV capsid library of claim 9, wherein (i) each unique chimeric AAV capsid protein comprises distinct polypeptide regions of at least three different AAV serotypes; or (ii) (ii) each unique chimeric AAV capsid protein comprises distinct polypeptide regions of greater than six amino acids in length that are derived from at least two different AAV serotypes.
 11. (canceled)
 12. The barcoded chimeric AAV capsid library of claim 9, wherein the barcode sequence comprises a single contiguous stretch of nucleotides inserted into the 3′ untranslated region (UTR) of the nucleic acid sequence encoding the chimeric AAV capsid protein, thereby allowing identification of the capsid protein by detection of the barcode sequence, wherein: (i) the barcode sequence is not interrupted by a primer sequence; or (ii) each barcode sequence does not comprise a Hind III or Xba I restriction endonuclease cleavage sites. 13.-14. (canceled)
 15. A method of producing a chimeric AAV capsid encoding nucleic acid library, the method comprising: (i) preparing a set of chimeric AAV capsid encoding nucleic acids; (ii) preparing a set of constrained, randomized barcode nucleic acids such that number of unique barcode nucleic acids is greater than the number of chimeric AAV capsid encoding nucleic acids; and (iii) combining each chimeric AAV capsid encoding nucleic acid with a unique barcode nucleic acid of the set prepared in (ii) to produce a chimeric AAV capsid encoding nucleic acid library.
 16. The method of claim 15, further comprising: (i) shuffling AAV capsid encoding nucleic acids to obtain a set of chimeric AAV capsid encoding nucleic acids, wherein each nucleic acid of the set encodes a unique chimeric AAV capsid protein having distinct polypeptide regions of at least three different AAV serotypes; and (ii) tagging each chimeric AAV capsid encoding nucleic acid with a unique barcode nucleic acid.
 17. The library of claim 15, wherein each of the barcode nucleic acids in the library have the formula: (NNM)n, wherein n is an integer in the range of is 3 to 20, and wherein N and M are each nucleotides that are independently selected from A, G, T and C.
 18. The method of claim 15, wherein: (i) the chimeric AAV capsid encoding nucleic acids each encode a unique chimeric AAV capsid protein having distinct polypeptide regions of at least three different AAV serotypes; or the chimeric AAV capsid encoding nucleic acids each encode a unique chimeric AAV capsid protein having distinct polypeptide regions of greater than six amino acids in length that are derived from at least two different AAV serotypes.
 19. (canceled)
 20. The method of claim 15, wherein each barcode nucleic acid comprises a single contiguous stretch of nucleotides inserted into the 3′ untranslated region (UTR) of the nucleic acid sequence encoding the chimeric AAV capsid protein, thereby allowing identification of the capsid protein by detection of the barcode sequence.
 21. A method of preparing an AAV library, the method comprising: (i) transfecting AAV packaging cells with the chimeric AAV capsid encoding nucleic acid library of claim 9; (ii) maintaining the cells under conditions that permit production of AAVs incorporating chimeric AAV capsids; and (iii) preparing an AAV library comprising AAVs produced in step (ii)
 22. A method of identifying AAV capsids tropic for a target tissue, the method comprising: (i) administering to a subject AAVs of an AAV library produced by the method of claim 21; (ii) isolating nucleic acids from cells of the target tissue; and (iii) detecting AAV capsid encoding nucleic acids in the isolated nucleic acids.
 23. The method of claim 22, further comprising: (i) amplifying the nucleic acids isolated from the target tissue to facilitate detection of the AAV capsid encoding nucleic acids; or (ii) confirming that the AAV capsid encoding nucleic acids are from the library by detecting barcode sequences.
 24. (canceled)
 25. The method of claim 24, wherein the barcode sequences are detected by colony hybridization, deep sequencing, or PCR. 26.-29. (canceled)
 30. A chimeric AAV capsid protein identified by the method of claim
 22. 